














JULY 1957 
J ournal of Volume XIII Number 3 


Clinical Psychology 














EDITORIAL BOARD 


FREDERICK C. THORNE, Editor 
Editorial Office: 5 Pearl Street 
Brandon, Vermont 


JERRY W. CARTER, JR. HOWARD F. HUNT WILLIAM A. HUNT 
U. S. Public Health Service University of Chicago Northwestern University 


ELAINE F. KINDER MAURICE LORR ROBERT I. WATSON 


New York State Department of Veterans Administration Northwestern University 
Mental Hygiene Washington, D. C 





ELEANOR C. THORNE, Business Manager 

















Journal of Clinical Psychology 


5 Pear] Street : : Brandon, Vermont 








COMPANION VOLUMES 


PRINCIPLES PRINCIPLES 
OF OF 
PSYCHOLOGICAL PERSONALITY 


EXAMINING COUNSELING 


by by 


FREDERICK C. THORNE, M.D., Ph.D. FREDERICK C. THORNE, M.D., Ph.D. 
Editor, Journal of Clinical Psychology Editor, Journal of Clinical Psychology 





PRICE: $7.50 PRICE: $6.00 





Systematic: The first comprehensive system of diagnosis and psychotherapy based 
solidly on basic psychological science. 


THEORETICAL: Presenting a comprehensive theory of the nature of personality 
integration with special emphasis on the differential diagnosis and modification 


of the primary etiologic factors which actually organize socially significant 
behaviors. 


Ecuxctic: All pertinent theories and facts in basic science psychology and clinical 
psychological science are related in a systematic applied integrative psychology. 


Comp.tete: For all teaching and reference purposes. 


ApvaNncep: Both of these books present a new orientation to diagnosis, counseling 
and psychotherapy which is more than just a rehash of the literature. Principles 
of Psychological Examining is more than a textbook of psychometrics since 
tests and measures are presented only as they relate to more fundamental prob- 
lems of psychological diagnosis. Similarly, Principles of Personality Counseling 
presents thorough discussions of the indications and contraindications for apply- 
ing all known methods of psychotherapy after a primary diagnosis of factors 
requiring modification has been accomplished. 





ORDER NOW 





JOURNAL OF CLINICAL PSYCHOLOGY 
5 Pearl Street Brandon, Vermont 








JOURNAL OF 
CLINICAL PSYCHOLOGY 


Vou. XIII JuLyY 1957 No. 3 








CONTENTS 
GENERAL REVIEWS 


Research in Clinical Paychology: Paster Analytic Contributions. W. Grant 
DAHLSTROM - - - - - - = = = = = = = = 
The Conceptual and Test Distinction of Neuroticism and Anxiety. Ray- 
MOND B. CaTTELL - - - - - - = = = = = = = = = = = = 
Critique of Recent Developments in Personality Counseling Theory. Frep- 
ERICK C. THORNE - - - - = - = = = = = = 





DraGnostic Processes: GENERAL 


The Construct Validity of the Sixteen Personality Factors Test. SAMUEL 
Karson and KENNETH BRYNER PooL - - - - - - - - 


Confirmation of Nine Psychotic Symptom Patterns. Maurice Lorr, JAMES 
P. O’Connor and JoHN W. STAFFORD - - - - - - - = - = = = 


The Loyola Language Study. Vincent V. Herr - - - - - - - =; = 


Measuring the Impairment of the Abstracting Function with the Proverbs 
Test. Ciypg— M. Etmore and Donatp R.GorHAM - - - - - - - = 


Evaluation of Intellectual Potential in Hemiplegic Individuals. FRANKLIN 
C. Smonrmse - - © © e© © © ees es © © © 


The Prediction of Adjustment over a Four Year Interval. Harry BrrLin - 





D1aGnostic Processes: RorscHacu STUDIES 


Factor Analysis of Rorschach Scores of Schizophrenics. Brrnarp A. 
Srovramy - -- -+*<ce cc ce cs e ce «= oe 


Effects of Different Methods of Structuring the Rorschach Inquiry upon 
Determinant Scores. STANLEY W. KLINGENSMITH - - 


Psychometric Scales for the Rorschach Popular Response. Dorotuy Park 
Guippmt - - - © © es es ee ee ee ss 





D1IAGNosTIC PRocEssES: SELF AND PEER PERSONALITY RATINGS 


Self Congruence, by Sex and Between the Sexes, in a ‘“Normal” Population. 
Joun G. Martire and Ropert H. HoRNBERGER - - - - - = = - 


Similarity of Personality in the Sociometric Preferences of Mental Patients. 
Wituiam G. SHIPMAN - - - - - - - - = - = = = 


Self and Peer Personality Ratings a Paychotherapiste. Eur A. RUBINSTEIN 
and Maurice Lorr - - - :« = ae 








CONTENTS—Continued 


The Scatter Pattern of a Southern Negro Group on the hed echsler-Bellevue In- 
telligence Scale. JutiaN C. Davis - - - “ar ee ae 


The Utilization of a Psychologist in Ward POs I ae WarrEN W. WEBB 
and Hrram L. Gompow - - © - © © = © = © © © © © wo @ oe 


Evaluation of Modified Administration of the California Peychological In- 
ventory. LAWRENCE A. BENNETT and ALVIN Ruporr - - -<°*e 


Seasonal Changes in Admission Rates of a State Hospital. Paut A. Hauck - 


Personality Changes Following Transorbital Lobotomy. Nancy B. ALGEo 
and MAxweELL§. PULLEN - - - 





Editorial Opinion - - - - 


Book Notices - - - - 


GENERAL INFORMATION 


The Journal of Clinical Psychology is an independent journal dedicated to the advancement of the 
clinical method in psychology. Although primarily a scientifically oriented professional journal limited 
to publication of original research reports and authoritative theoretical articles, it aims to foster the 
promotion and expansion of clinical psychology as an applied science. Special sections in each issue 
will be devoted to news, comments, and letters to the editors on matters pertinent to the profession. 
Communications and articles submitted for publication are welcome and should be addressed to the 
Editorial Office, 5 Pearl Street, Brandon, Vermont. 


ORIGINAL ARTICLES are published only with the understanding that they are contributed 
exclusively to the Journal of Clinical Psychology. The publishers are not responsible for statements 
made or opinions expressed by contributors in articles printed in these columns. Copyrights cover 
publication in the Sareil of Clinical Psychology and articles may not be reproduced without 
permission of the publishers. 


MANUSCRIPTS must be typewritten, double-spaced with good margins, on one side of the 
paper only. The publishers will not be responsible for manuscripts lost in the mails or through un- 
avoidable accident. Retain a carbon copy after sending the original to the editor. It is requested that 
all articles be as brief as consistent with clarity. All manuscripts are subject to editorial modification. 


THE STYLE of all manuscripts including ene must be in accordance with the in- 
structions given in: Anderson, J. E., and Valentine, W. L. The preparation of articles for publication 
in the journals of the American Psychological Association. Psychol. Bull., 1944, 41, 345-376. Refer- 
ences should be arranged alphabetically according to the directions given above. Full address of the 
author should appear somewhere in the article, preferably at the a 


REPRINTS are furnished on order only and must be requested on the order blank provided 
when galley proofs are returned. Change of address notices should be sent one month before moving 
and should include old and new addresses. Undelivered copies will be remailed only at subscribers’ 
expense. 


The Journal of Clinical Psychology is published quarterly, appearing in January, April, July and 
October, at Rutland, Vermont, by Frederick C. Thorne, M.D. Subscription price $8.50; Canadian 
subscriptions $9.00; foreign subscriptions $9.50 including postage; single copies $2.50. Entered as 
second class matter January 15, 1945, at the t office at Bur rlington, Vermont, under the Act of 
March 3, 1879. Reentered at the post office at randon, Vermont. Additional entry at the post-office 
at Rutland, Vermont. Title registered, U. S. Patent Office. Editorial and Business Offices, 5 Pearl 
Street, Brandon, Vermont. 


Copyright, 1957, Frederick C. Thorne, M. D. 





RESEARCH IN CLINICAL PSYCHOLOGY: FACTOR ANALYTIC 
CONTRIBUTIONS 


W. GRANT DAHLSTROM 


University of North Carolina 


GENERAL THEORY 


Factor analytic techniques are having a significant and growing impact on re- 
search in clinical psychology, both from the number of publications in recent years 
in which factor analytic methods have been employed, and from the contribution 
factor analytic models have made to the conceptual framework in which ability, 
temperament and personality are viewed. Clinicians cannot comfortably remain in 
ignorance about the conceptual, methodological or logical ramifications of factor 
analysis and still try to keep up with the substantive contributions stemming from 
the application of these techniques. In this review, an attempt will be made to epi- 
tomize the factor analytic approach, describe some of its limitations and survey some 
of the major theoretical issues being subjected to factor analytic study now. 


An excellent non-mathematical treatment of the logic of standard factor an- 
alysis has been provided by Hall and Lindsey *’. Only a few major points can be 
made here. Variation is the basic concern of any science, the origin of explanatory 
schemas and the source of criteria for predictions. Two sources contribute variation 
for the study of personality, inter-individual and intra-individual differences. The 
assumption is made that differences between various persons reveal the operation of 
important variables which are pertinent in forming the personality characteristics of 
any one person, even though such characteristics may not be constant. Differences 
arising from changes in the reactions of an individual over a period of time similarly 
disclose factors affecting less permanent personological states. Personality descrip- 
tion is made in terms of these comparisons; scientific study is directed to the search 
for the relevant determiners of these variations. Factor analysis proceeds on the 
assumption that this scientific study is facilitated by an intermediate step of seeking 
descriptive economy in a small set of variables labelled variously traits, abilities, or 
factors. These few variables are not offered necessarily as explanations or as bases 
of prediction, but as useful alternatives to the vast array of possible comparisons 
arising from combinations of various specific values of these factors. Thurstone © 
in a general characterization of this step described it as an attempt to reduce sig- 
nificantly the number of degrees of freedom in the construct set from the number in 
the set of original observations. 

The classic scientific method calls for the direct manipulation of one variable, 
the independent one, and the close scrutiny of its effect on another, the dependent 
variable, while all else is held as constant as possible. In factor analysis, too, the 
emphasis is on covariation. The operation of a variable is to be observed, not by 
directly manipulating it, but by allowing it to vary and scrutinizing the correlations 
it generates amongst a large set of observations. Some writers say that factor an- 
alysis enables the investigator to examine simultaneously the dependencies among 
a large set of variables, and through statistical rather than manipulative control, 
run several “experiments” at the same time. Thus, factor analysis is offered as the 
method of choice when the investigator is in considerable ignorance about the area 
under study; ignorant, that is, about the variables, the relationships, and the requis- 
ite manipulations. It is also offered as suitable when the requisite manipulations are 
prohibited, as in the areas of mental illness and intellective abilities. 

In this context of sheer exploration, several assumptions are advanced as first 
approximations only (cf. Guttman®’)). Consideration is given only to variables 
which are linearly related in the observational set. Also, many factor analysts 
assume orthogonal independence amongst the factors in the construct set. In addi- 
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tion, the usual specification equation for any one individual’s performance on any 
one observational variable (test) is based on a simple additive combination of pro- 
ducts of factor weights of task and person. These are tentati\e decisions to be used 
for theory generation and to be changed when the evidence forces some alternative 
mathematical form. Working assumptions such as these hypotheses provide are 
necessary, but some additional requirements must also be met if the investigator 
is to be able to proceed beyond these first approximations. There must be a way of 
knowing when one is dealing with the same source of variance in several different 
investigations, so one may determine whether some non-linear relationship is opera- 
tive in the construct set. There must exist some sampling model to be used as a 
guide as to when to reject an assumption of linearity or simple additivity. Most im- 
portantly, there must be freedom from implicit assumptions in procedure that force 
the data into the prescribed form. In this last respect, factor analytic methods carry 
over from the ability domain many hidden assumptions about the dimensions, their 
reality status and their interrelationships. 


THe Domain oF FAcTORIAL STUDIES 


It is regrettable that factor analysis was not advanced as a special instance of 
analysis of variance. As these general procedures are now described, no one would 
offer as reasonable investigative behavior the blind partitioning of the variances ob- 
tained from sets of observations. The guiding principle in the partitioning of the 
experimental data in analysis of variance stems from psychological theory and from 
the observations or manipulations of the independent variable that are generated by 
the theory. The statistical considerations are neutral in regard to the theory in- 
volved. Factor analysis provides a way of partitioning the covariational data, but 
the lines of segmentation should come from a careful articulation of theory and ob- 
servation. No procedure, even rotation to simple structure, should be exercised 
without consideration of the theoretical implications of the set of observations. Many 
writers have implied that factor analysis is a useful procedure only in the stage 
where the investigator has no theory and that it loses its usefulness when the in- 
vestigator becomes an experimental manipulator. This seems wrong in two respects: 
a great deal needs to be known to use factor analysis appropriately in an area, and 
the more known the better, while there is no reason to believe that the adroit use of 
factor analysis cannot help clarify problems in areas where direct manipulation has 
long been employed with great power (cf. Lewis, McAllister, and Bechtoldt ™: **), 

The first necessity for detailed knowledge in the application of factor analysis 
to a problem area is in the definition of the domain itself. The problems involved 
here in terms of the arbitrariness of the decisions and their effect on the subsequent 
findings can be illustrated by a familiar analogy. 


Let us consider a psychologist using the well known Object Sorting Test of Goldstein and 
Scheerer @*). In laying out the materials in front of the subject, the investigator may choose, say 
several pieces of cutlery, a pencil, a letter opener, an ashtray, a pipe, and two cigars, one o 
tobacco and the other ob eal: With this particular array of objects, the sorter might, if he were 
taking a variety of abstract attitudes, place the objects in classes which involve cutting edges, 
objects which pertain to smoking, objects made of metal, etc. He also might form a class of ob- 
jects which are placed in the mouth. It is clear that the particular concepts which the subject is 
most likely to come up with are a function of the particular objects presented. We are more likely 
to get the subject to sort the objects into classes relating to smoking if, in addition to the cigar, 
pipe, and ashtray, we add the matchbox and matches. In this case, the candy cigar might not be 
placed in this class. However, if we add some objects not generally employed in the test like a 
candy bar and a penny sucker we would be much more likely to get the subject to sort out a class 
of confections. In fact, we could change the behavior of the subject rather drastically if we added 
not just one candy bar but a variety of candy bars, differing in presence of chocolate, nuts, caramel 
or even went further and added a variety of forms of confections sampling widely from a com- 
plete candy store. Under these circumstances the subject may well form a variety of groups of 
the confections, meanwhile placing all of the non-candy items into one class. 


The various factor analytic methods advanced, including the centroid method of 
Thurstone with rotation to simple structure, prove to be sensitive to the ‘‘elements”’ 
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which are placed in the matrix as is the behavior of the hypothetical test subject 
above. If in our ignorance of a psychological domain we include only one test which 
reflects some particular source of variance, x, then this source of variance will not 
appear in the common factor matrix. In order for the factor x to appear we would 
have to include several measures of this source of variance; from the resulting inter- 
correlations amongst these several x-related measures, we would get common factor 
space reflecting this variance. In the very early stages of investigation of an area, 
however, the psychologist is apt to omit a particular line of study from his matrix, or 
represent it inadequately, so that the common factor space does not include this 
variable. Similarly, when additional observation variables are added to the matrix, 
the relationships amongst the factors are altered. Thurstone® has advanced the 
simple structure solution to provide such metric invariance. It was also intended to 
provide configural invariance in the factor solutions when the same observations are 
made on different populations of subjects. Recent publications have raised doubts 
that invariance in simple structural solutions can be assured in either of these situa- 
tions. As one example, Fleishman and Hempel “?, by adding tasks to a psychomotor 
battery, split a previous factor, Psychomotor Coordination, into two factors, one 
emphasizing fine, highly controlled adjustments, the other involving gross co- 
ordination of more extensive movements. Addition of tasks to a battery not only 
may extend the common factor space by bringing into consideration variance pre- 
viously represented only on one task, but may accentuate differences in variance 
hitherto assigned to one source. 


A similar point can be made from the study of Kettner et al “) in which Thur- 
stone’s simple structure was not employed but in which a test, Problem Solving, 
failed to show its familiar relationship with a general reasoning dimension, pre- 
sumably because the task was too easy for the selected group of Coast Guard cadets. 
In both of these studies cited above, the departures from expectation had to be 
interpreted qualitatively in the light of past experience with the materials and not on 
the basis of clear-cut evidence of greater than chance variation in factor loadings. 
The solution to this problem would appear to lie in the judicious, theory-based use 
of reference tests about which a great deal is already known, together with appro- 
priate sampling models along the lines suggested by Lawley “* or Rao), 


A further point should be made about this problem. Formal means for compar- 
ing factor analytic solutions from separate studies have not been developed in detail. 
Vandenberg “?, for example, studied the effects on a group of abilities and their 
intercorrelations when translated into another language. He gave thirty-five tests 
to bilingual Chinese students and compared the factor structures obtained in Eng- 
lish and Chinese versions. He based his comparisons on Tucker ¢,, an index of 
similarity between factor structures. Ahmavaara“™: ?) also has developed a method 
he calls transformation analysis for direct comparison of factor solutions from sep- 
arate studies. Unfortunately, these indices must be interpreted qualitatively as yet 
since they lack sampling specifications. 

In this problem of the definition of the domain, it has usually been assumed 
that as the knowledge of an area accumulated, the domain would gradually be re- 
defined and appropriately clarified. This hope is certainly a reasonable one, but the 
usual factor analysis does not provide a seminal basis for such restructuring of the 
area. The difficulty is that in the factor plots, the dimensions do not differ in ways 
that wouid suggest the levels of conceptualization required to fit the observations 
into a broader theoretical framework. In the ability area, some innovations have 
been introduced that stemmed mainly from factor analytic surveys, but a great deal 
of credit historically should go to the non-factor analytic work of Binet, of Terman, 
and of Pintner, for example, in clarifying the proper area of study in the intellective 
domain. The necessity for empirical ties with non-test criteria has been overlooked 
in much factor analytic work on personality tests, to the detriment of many care- 
fully constructed instruments. This requirement is explicitly recognized by Ey- 
senck °° in his criterion analysis step prior to further factor analytic work. 
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THE INTERPRETATION OF Factor PLots 


The lack of inherent ‘‘meaning”’ in the factor plots is not necessarily a limitation. 
No one is worried that the error term in an analysis of variance does not provide in 
itself clues as to the sources of uncontrolled variation. The point is that workers 
have advanced factor analysis techniques and results as somehow theory-generating. 
Currently there is a great deal of concern about the contribution of the examiner to 
variation in personality test performance. The examiners may well show up in a 
factor analysis as a source of variation amongst a set of variables, but this ‘“dimen- 
sion” would not be in any way ditierent than the effects of a certain response set or 
instructional difficulty, when represented in the factor plots. 

Although some writers advance factor analytic results as mere descriptive 
economy, many write and act upon implicit or explicit existential hypotheses con- 
cerning the factors. For them the factors are there, the investigator has merely to 
find them. A careful treatment of these dangers of reification in psychological con- 
structs can be found in Bergmann“, but a few comments are pertinent here in the 
context of factor analytic work. The issue is subtle but important since one method 
of factor analysis or another, or one method of rotation or another, is advanced be- 
cause it will provide the identification of the ‘‘real’”’ factors. 

A form of this issue appears in the work of Rosenthal“) on the problem of 
anxiety. For her purposes she employed 54 variables alleged to measure anxiety in a 
long list of studies in the area. She posed her problem as an attempt to determine 
“whether one concept is referred to in these studies” or not. She included question- 
naire data, physiological measures, conditioning and learning variables. She inter- 
correlated these measures on 70 students and performed a centroid factor analysis 
with a rotation to simple structure. In one sense, she could have answered her 
question directly by inspection of the intercorrelational matrix. If all the variables 
did not fall within the positive manifold, that is, if all the variables were not signi- 
ficantly and positively correlated, she would have had an answer to her problem. 
She found that they did not. Secondly, she just as legitimately could have employed 
a centroid factor analysis without rotation. Examination of the first factor loadings, 
the way that each of the measures was related to the general mean of the test vectors, 
would have demonstrated the usefulness of a single construct. Since she went to 
some pains to be exhaustive in regard to the important measures advanced for anx- 
iety, she also could have justified the use of Hotelling’s®® principal component 
method (placing values of unity in the diagonals in the matrix) and the values of each 
of the measures of the first principal component would have been equally informa- 
tive. Since, however, she did extract 11 factors and rotated these to an oblique struc- 
ture, the question under study should probably have been addressed to a second 
order factor analysis of the loadings on these 11 factors. Granted that particular 
errors or systematic non-anxiety variance enters each or all of the measures in her 
matrix, her attention really should be directed toward the role of one central second 
order factor amongst her group factors. If such a single source of variance was found 
this would be a legitimate definition of anxiety for her purposes. It is unfortunately 
impossible to tell from her excellent abstract how much attention she paid to these 
considerations in her full report. 

These implications concerning the reality status of the factors also appear again 
and again in the modifications of factor analytic methods involving cluster search or 
type-finding techniques (cf. Sakoda“). One of the recent modifications receiving a 
great deal of attention is the D function of Cronbach and Gleser“*) which expresses 
the similarity between test patterns. As Lykken®” comments about this statistic, 
“Only by remote coincidence would a measure of the dimension of geometrical dis- 
tance serve also as a metric for a psychological dimension, and a suitable index of 
similarity for (some) one attribute could not, by definition, measure similarity in 
(some) other attribute if the two are truly independent.” Many factor studies seem 
to be bound by servile conformity to some particular mechanical solution in the false 
belief that in some magical way the method will uncover the hidden, basic truths. 
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Factor ANALYSIS AND PsyYcHOLOGICAL THEORY 


The full and appropriate application of factor analytic techniques is further 
hindered by its historical relationship to a sterile, clumsy trait theory. Work in the 
area has not led as yet to anything resembling, in Bergmann’s“ terms, process laws 
of psychological variables, nor to any serious consideration of the combinatorial 
problems which await empirical investigation. Much of this work has suffered from 
the inherent difficulties of what Bergmann © and Spence “®’ have termed R-R formu- 
lations which better serve taxonomic and classificatory purposes than the aims of 
generating theory. The taxonomic problem may be resolved by a multiple correla- 
tion and its related beta weights, or a linear discriminant function, but the simple 
addition of variables according to some geometric model or vector schema does not 
meet the requirements of behavior theory at the current level of sophistication. The 
construct of anxiety, mentioned above in the study of Miss Rosenthal, serves as a 
good example of this point. Factor analytic methods may well furnish evidence and 
information that would lead to some construct like anxiety as a source of individual 
differences. In fact, the description Miss Rosenthal gives of her factor 1 is very 
close to Taylor’s Manifest An>iety Scale or Welsh’s Judged Anxiety Scale and his 
first Factor Scale, A. But there is little basis within the traditional use of factor an- 
alysis to expect that the additional specifications about its conditionability (and 
extinguishability), its drive properties, its cue-producing function, its role in the 
reinforcing process, its enhancement of behavioral efficiency under some specific 
conditions and its inhibitory or disruptive effects on on-going behavior under slightly 
altered conditions could easily be obtained merely from continued factor analytic 
exploration of the domain. These important considerations have been assigned to 
anxiety as a consequence of the introduction of this construct into a framework of 
behavior theory according to clearly specified rules of scientific theory building. 
Many of these interrelationships are not monotonic, few of them are simply additive, 
many of them are highly dependent upon momentary quantitative values of both 


the anxiety variable and other constructs in the system. The great importance these 
problems of personality organization have in current research (cf. Thorndike “®) 
makes it all the more deplorable that factor analytic work continues without proper 
ties with a sophisticated psychological theory (cf. Franks»). 


Factor Stupies oF W-B PatrEerNING 


One of the most widely studied problems in current clinical research has been 
the problem of Wechsler-Bellevue patterning. Some of the key issues in this area 
could have been cast in a form in which factor analytic methods might have been 
usefully applied. One form, spelled out in most detail by Jastak “) in his concentric 
method of test analysis, was that all of the forms of intellective behavior sampled 
(on the subtests of the W-B, for example) reflect one basic ability and therefore a 
person with a given ability should be expected to have the same standing on any of 
the measures. When he does not, the deficiencies appearing on some of the subtests 
reveal areas in which he has not been developed to full capacity. The level one can 
expect the person to develop is set at the level of the highest performance in the 
battery, t.e., Jastak’s altitude score. Several problems are involved in the interpreta- 
tion of test profiles in this way (cf. Brown and Bryan“), including the fact that even 
the highest value could itself also be ‘‘depressed”’ to an unknown degree and the fact 
that this whole approach ignores the problem of errors of measurement, which are 
undoubtedly different for each of the subtests. But the crucial question is whether 
each of the subtests can be considered a homogeneous measure of two sources of 
variance: a source common to all of the subtests and a source specific to the subtest 
itself. If not, the concentric model cannot be appropriately applied, and the infer- 
ences drawn from the profile must be considered erroneous. 

The analysis of Davis“’ was based on a matrix which included not only the 
W-B subtests but also reference tests from Air Force studies and other sources that 
were well known in terms of their relationships within the intellective domain. He 
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used these reference tests to help determine his final rotational solution and found 
that in these terms, all of the subtests of the W-B were reflecting multiple sources of 
variance for this junior high school population. The vocabulary, arithmetic, and 
picture arrangement subtests were the Jeast complex. The comprehension subtest 
was found to be very complex in his solution but even so the amount of its variation 
reflected in this battery was small. Each of the sources of variance from this common 
factor space in the comprehension subtest had a higher representation on some other 
subtest in the W-B. From data like those provided by Davis, considerable doubt 
exists that the W-B meets the assumptions listed above for the “‘single-factor” inter- 
pretation of the profile. 

Another form of the patterning research on the W-B is that all of the variance 
in the subtests that is not attributable to the intelligence dimension for which the 
W-B was constructed is a function of different non-intellective sources of variance 
that are important in evaluating the subjects’ personality status. This question, too, 
is easily submitted to factor analytic scrutiny. Frank “ approached this question by 
means of a p-technique analysis in which persons are correlated, rather than the 
standard r-technique in which tests are correlated. Burt‘ has demonstrated the 
algebraic relationships existing between these two procedures when they are based 
on the same tests and subjects. Stephenson “* has extended the procedures based 
on the correlation of persons beyond this by using samples in which the interchange- 
ability mentioned above is no longer possible. The procedure of correlating persons 
involves the reduction of the separate measurements involved in the correlations to 
standard scores, which has been done in the weighted scores of the W-B. Frank took 
the additional step needed when the number of observations is small of using rank- 
order correlations for his original matrix. His subjects came from the work of Rapa- 
port et al.“ and included 60 neuropsychiatric patients differing in sex, age, intelli- 
gence, and psychiatric diagnosis. He found two dimensions, which in p-technique 
appear as types of persons. These groupings did not show any homogeneity in respect 
to age, sex, or diagnosis. Rather, the sources of variance were most economically 
ascribed to performance and verbal abilities. In his brief report, insufficient data 
were provided to enable the reader to judge the evidence he used for stopping at two 
dimensions but if he appropriately followed Thurstone in the application of the 
centroid method of factor analysis, little additional variance remained in the inter- 
correlational matrix after extracting these two factors. Inspection of the magni- 
tudes of the loadings in his factor table would seem to support this. This evidence is 
consistent with the findings of Wittenborn and Holzberg“* and should be sobering 
to those who are prone to generate elaborate speculations on the basis of highs and 
lows in ability profiles on psychiatric cases. 

A study of differences in the interrelationships existing among various intellect- 
ive measures in groups of children of different socio-economic levels which was re- 
ported by Mitchell “ has bearing on the very important question of non-intellective 
biases in our diagnostic instruments. He employed a direct measure of S-E status for 
grouping his subjects that were initially drawn from schools which were known to 
differ in their sampling characteristics on this dimension. He employed an array of 
well-constructed reference tests from the Psychometric Laboratory of Thurstone 
and the short-form version of the California Test of Mental Maturity. He also in- 
cluded two of the kinds of tests most criticized by Davis and Hess“® as biased in 
favor of middle class groups, the Otis and the Henmon-Nelson. One of his most im- 
portant findings concerned the higher interrelationships amongst the usual ability 
variables in the lower S-E groups, suggesting less differentiation amongst the re- 
quisite skills. For the low S-E group the verbal features of the tasks posed the 
problem, either in the instructions themselves or in the wording of the item, so that 
these tests were just variations on reading and verbal comprehension tasks. This 
difference in structure was not eliminated by taking sub-samples from his two 
groups that were matched for scores on the Otis but preserved the original S-E 
differences in the total groups. The question of the possible influence on these 
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structures of differences in variances on the original measurements was also explored 
and dismissed. The influence of this lack of ability differentiation on some of our 
standard diagnostic instruments should be investigated. 


Factor ANALYTIC StTuDIES OF PsycHIATRIC D1AGNostTic SYSTEMS 


The importance of the psychiatric diagnostic system as a source of criteria in 
test construction, pattern analysis and evaluations of therapeutic techniques makes 
the clarification of this domain by factor analytic methods of particular interest. A 
great deal of the work along this line from Moore “° on has been based upon symp- 
toms or characteristics reported by observers or therapists. The problems of lack of 
reproduceability of these original observations and the lack of experimental in- 
dependence amongst the set of ratings or judgments made by a single observer on a 
single assessment sample have been particularly plaguing. Fiske °?) has pointed out 
very cogently that the factor dimensions may reflect to an unknown extent the halos 
and stereotypes of the raters and have little reference to the behavior patterns of the 
patients under study. A recent exchange of comments between Conger et al. “?» 1%) 
and Stephenson “’) has served to point up the way rater variables have confounded 
the work of Beck on schizophrenic types“. Since Beck included correlations in the 
matrices to be analyzed that were importantly a function of the raters themselves, 
the particular groupings of patients he formed amongst the schizophrenics he 
studied must be examined carefully and subjected to independent confirmation on 
new groups of cases and observers. 

Another integrated series of projects in the psychiatric domain that has lead 
to the espousal of three main dimensions of variation is that of Eysenck and his 
coworkers '* 1% 2 21). They have considered these taxonomic dimensions to be 
extraversion-introversion, neuroticism and psychoticism. Workers in the psychiatric 
area are generally surprised that Eysenck is willing to settle for so few basic dimen- 
sions in a domain in which most observers have felt the need for many more ways of 
describing the observed differences. The way that this economy is achieved can be 
seen from a few examples from Eysenck’s analyses. For him, ‘‘neurosis’’ is mainly 
dyspepsia and hypochondriasis“* »- *-*), while hysteria is comprised largely of 
“extraversion.”’ Psychasthenia (obsessional) is mostly “‘introversion”’; little neurotic 
loadings appear for either of these latter two diagnostic designations. Combinations 
would be used to describe depression, anxiety, and the like. 

There has been some question, however, whether Eysenck’s economical system 
was more a function of the limitations of the material included in his matrices than 
any revolutionary way of recasting the domain of psychiatric nosology. Recent 
evidence on this point comes from a study from the Maudsley group. Trouton and 
Maxwell ©?) studied a large group of male psychiatric patients and obtained data 
from a checklist of observations routinely recorded on all cases admitted to the unit. 
They selected 45 variables which would cover the domain as specified by Cattell 
and intercorrelated them by tetrachoric methods after assuring themselves that each 
item occurred in at least ten percent of the group. Six factors were extracted by 
Thurstone’s centroid method before the residuals approached the chance level. The 
authors identified two of these sources of variance with Eysenck’s neuroticism and 
psychoticism, but found such important considerations in psychiatric status as 
thought disorder, depression, social withdrawal, and possibly hypochondriasis and 
obsessionality appearing in important degrees on the four other dimensions. After 
a careful consideration of some of the sources of bias in this method, such as semantic 
relations amongst synonyms and antonyms, the fluctuations in behavior over an 
entire period on the psychiatric service, and arbitrary separation of similar features 
because of psychiatric formalities in mental status description, the authors con- 
cluded that neither the usual psychiatric diagnostic system nor the first order ortho- 
gonal factor structure provided a clear framework for the classification of the ob- 
servations. They expressed some hope, however, that such a schema could come 
from a second-order analysis amongst the rotated first order factors. 
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APPLICATIONS TO PERSONALITY TESTS 


As Cattell “® indicates, there are a restricted number of personality instruments 
in existence that have had factor analytic scrutiny of either the homogeneity or the 
independence of the component scales. It is equally noteworthy that such instru- 
ments that are available have proven to be of limited utility, impractical and relative- 
ly unpopular. To this reviewer, the difficulty seems to lie in the omission of the step 
mentioned earlier as “criterion analysis’. Although the need for external criteria 
has been recognized for a long time by such empiricists as Strong for the SVP and 
Hathaway for the MMPI, the development of tests by factor analytic methods has 
not proceeded along this line. It is a little disturbing to find that Cattell, in describ- 
ing the reworking of his elaborate 16 P. F. questionnaire, employs the highly honor- 
ific terms, “cross-validation” and “construct validity’’,“® in radically altered 
meaning. Working with different samples of highly selected groups of college stu- 
dents or Air Force men, Cattell claims to have cross-validated the items in his 
various scales by demonstrating that the items fall on the same dimensions in the 
factor plots of the new samples that they fell on in the original sample structure. 
The structures in the new samples were heavily determined by these very items, 
however, which makes this invariance less surprising. The more disturbing feature 
of this circular argument is that none of this material furnishes any evidence on the 
pertinence of the interpretive labels applied to the scales. Interpretation must be 
made within a body of psychological knowledge. Validity means that one can reason- 
ably make the logical substitution implied in the scale title. Cross-validation means 
that one has independent evidence about the justifiability of these inferences, free 
of the specific sampling characteristics of the original group on which the instrument 
was developed and the local idiosyncrasies of the non-test criteria. For Cattell’s tests 
there is neither freedom, since the criteria are not independent of the test or of the 
original developmental details. Construct validity, as employed by Meehl and 
Cronbach“), implies that the test scores relate to non-test variables in a way re- 
quired of them within the nomological net of psychological theory if the scale is 


validly related to the referent construct within that theory. For Cattell, however, 
this use is abandoned and he has equated construct validity with internal consistency 
methods “®: »- 2%). For this well-worn notion, he seeks to substitute the new term 
“intensification’’. 


Factor Stupiges oF THE MMPI 


There is a large literature now available on the attempts to re-evaluate the 
empirically derived MMPI by means of factor analytic techniques. For a great deal 
of this work the point mentioned previously about the importance of careful de- 
finition of the domain must be re-emphasized. For some of the work there is an ex- 
plicit interest in the variance common to the clinical scales. Welsh, for example, 
was trying to clarify the construct of “general maladjustment”’ that would lead to 
positive intercorrelations amongst the separate scales. When attention is directed 
to the sources of variance reflected in some one scale, or in the MMPI item pool, 
then non-MMPI variables must be included in the matrices. An example of this 
problem is the study of O’Connor et al.“!) on the MMPI depression scale. They com- 
puted the intercorrelations of 49 of the 60 D scale items based on the replies of 300 
V. A. neuropsychiatric cases, factored the resulting matrix and reported five sources 
of variance amongst the scale responses. The authors conclude, “The results indicate 
the multidimensionality of the D scale and call into question the practice of attribut- 
ing to the scale a simple unitary significance.” This conclusion is either trivial or pre- 
mature, depending on the emphasis given to it. A mountain of evidence is now avail- 
able that the multiphasic nature of personality inventories lies at the item level. 
Every item in the MMPI can be and is interpreted differentially, by different sub- 
jects and by the same subject at different times. It is now trivial therefore that 
evidence should be offered that the items do not behave as a Guttman-type scale 
when grouped in the D scale of the MMPI. However, there is still the possibility that 
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the psychological construct, depression, as employed in the psychiatric nomological 
net would demonstrate the same multidimensional properties assigned to the D 
scale by these investigators. Although it does not seem likely that this would be the 
case, one cannot decide the question of the D scale’s ‘“‘simple unitary significance” 
without including in the factor analysis some non-MMPI variables related to the 
construct, depression. In a similar way, studies purporting to show that the MMPI 
is only two-dimensional, or three-dimensional, and not a nine-scale clinical instru- 
ment are equally misleading. The dimensionality of the MMPI pool is literally in- 
finite: any reasonably intelligent respondent could go on endlessly forming and re- 
forming classes into which MMPI items could be assembled. One of the most promis- 
ing subsets of these classes now under investigation is formed by the configural 
scoring methods of Meehl “* and of Horst °), but the dimensions to be “discovered” 
within this subset alone are probably limitless, too. 

Skillful use of factor analytic methods in development of personality test in- 
struments, in which the methods do not seem to dominate the psychological theory, 
is being shown in such different areas as the TAT-related need achievement measure 
by Clark and McClelland“, the role repertory by Levy and Dugan“), and the 
factor scales by Welsh». 

SUMMARY 

In spite of the forbidding mathematical hurdles and computational labors in- 
volved, factor analytic techniques are currently very popular and are having a sig- 
nificant impact upon research in clinical psychology. Seeking clarification of areas in 
which little is known, or desiring some statistical control over variables not directly 
under manipulative control, investigators have adopted factor analysis as a way out 
of these difficulties. Unfortunately they often have been heedless of the limitations 
of the techniques, ignorant of the assumptions underlying the methods, and un- 
critical of the burden of implicit psychological connotations carried by the factor 
analytic rationale. The nature of some of these hypotheses concerning the reality 
status, distributional forms and interrelational organization of factors and the im- 


plications they may have on the design, execution and interpretation of investiga- 
tions were examined in some detail. The need for closer articulation with psychologi- 
cal theory in the application of factor analysis was emphasized and the need to free 
factor analytic methods from some of the unwarranted theoretical and psychological 
connotations they now carry was described. 
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1. THe PRoBLEM OF DEFINING THE CRITERION 


The strategically-planned researches examined in this article lead to two test 
batteries—one for degree of neuroticism and the other for intensity of anxiety. 
However, the details of their administration, standardization and effective use will 
be relegated to a later publication “’’, though initial research forms are already avail- 
able “®: 2). Here we are concerned with the theoretical developments which made the 
tests possible and which are psychologically more important than the tests them- 
selves. 

Even when test measurement procedures are of primary interest, one can get 
nowhere without an adequate theoretical treatment of validity. And in the case of 
anxiety and neuroticism, the discussion of validity requires some fundamental think- 
ing in the logic of science. Obviously, it is necessary to reject at once such practices 
as making an “anxiety scale’ by heaping all kinds of anxious and pathological 
responses into a conglomerate questionnaire, or the notion that one can point to 
some single, directly-measurable, clinical criterion against which a test can be vali- 
dated, as the industrial psychologist validates against an occupational performance. 
Some may assume that a psychiatrist’s rating constitutes such a criterion. If general 
observation alone did not explode this last illusion, the recent work of Scheier“”?, 
obtaining an r of 0.25 between the anxiety ratings on 80 patients by two psychiatrists 
selected for diagnostic reputation, would suffice to do so. 

It is equally obvious that one cannot validate anxiety measures by showing that 
neurotics score more highly on them. We do not have two terms for nothing, and 
though neurotics may tend to have significantly higher anxiety, the better the corre- 
lation of the test with neuroticism, beyond a certain point, the poorer and more con- 
taminated it is as a measure of anxiety itself. And while some have thus confounded 
anxiety with neuroticism, others have allowed the definition to slide into stress, and 
similar concepts, or, in a final masterpiece of confusion, into ergic drive strength 
itself. It is our task to find a sure methodological basis for separating anxiety, neuro- 
ticism, and, peripherally, stress and fatigue. 

The multivariate experimental method, as distinct from the univariate mani- 
pulative experiment, does not start with guesses (postulates) regarding the unitar- 
iness of concepts, but shows by analysis of the covarying of phenomena what func- 
tional unities exist. Of course it still remains a semantic and experimental problem 
to decide which of the factor unities, A, B, C, etc., thus discovered are entitled to be 
interpreted and named anxiety (or anxieties), neuroticism, etc. It may be desirable 
for a time to represent the new operational entities only by symbols, or novel terms, 
thus avoiding the load of tangled misconceptions imported in such popular terms as 
“intelligence”’ or ‘‘anxiety’’. 

Sooner or later, however, it will probably prove desirable to transfer these com- 
mon terms, by a specific act of definition, to the concepts crystalized in the discovered 
factors, when further verified by experiment, as having the right to inherit whatever 
core of reality exists in the common term. If this transfer is to take place, however, 
the factor must be one that is central in a very comprehensive sample of variables 
commonly semantically attached to the idea in question—in this case anxiety. In 
the series of experiments summarized here, great care has in fact been taken to in- 
clude representatives of all kinds of labelled anxious behavior according to pro- 
fessional usage, and, as it happens, a single major factor can be established in this 
area. The area is perhaps most succinctly, but comprehensively, covered by Thorne’s 
clinical evaluation in the following condensed quotation. ‘Fear and anxiety are 
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associated with widespread psychosomatic syndromes characterized by feelings of 
tension and irritability, “‘band like” frontal and occipital headaches, dryness of the 
throat . . . gastro-intestinal symptoms, generalized fatigue and weakness, and 
“floating”’ or fixed (phobias) apprehension”. ». %8) Conceptual validation“ of 
test measurements thus becomes no longer a matter of correlating any test with one 
special criterion, e.g., an occupational success, but of correlating it with the discov- 
ered designated major factor in the agreed general area of manifestation. 

However, medicine also knows an alternative approach to defining syndrome 
entities (correlation clusters or factors), namely by clinical recognition of pathogno- 
monic symptoms, rooted in a certain etiology. Psychiatrists trained in this medical 
approach are inclined to depend on it a good deal and to argue that the search for 
unitary syndromes without knowledge of or regard for their causal factors is likely 
to be blind and confusing. Unless one knows, for example, that the patient has had a 
venereal infection, the initial onset of general paralysis would be quite difficult to 
distinguish from several other psychopathological syndromes. 

A rather profound methodological issue is involved here, for fuller discussion 
of which the reader must be referred elsewhere * in which it is maintained that 
cross sectional recognition of form has to be prior to longitudinal perception of 
etiological change—as the single, well-focussed, cross sectional ‘frames’ in a movie 
film must exist before movement of objects can be seen. In general, a pattern or en- 
tity must be definable before its appearance and disappearance can be studied. How- 
ever, the methodological issues here include some aspects of the hen-and-egg con- 
undrum, and it must be admitted that a methodologically-independent longitudinal, 
etiological basis for identification of functional unities exists. However, the latter 
requires, similarly, an application of factor analytic methods, in P-technique, in 
which situational causes and organismic effects are correlated longitudinally “?. 

Except when the basic methodological considerations just mentioned permit 
longitudinal analysis, and with the developed statistical treatment of etiology just 
described, the medically trained psychologist is in danger of depending excessively 
upon identification by etiology. He is used to diseases that have very specific—and 
known—causes, whereas the psychological response patterns are more complexly 
combined and less specific to one causal situation. A germ of measles produces a 
measles syndrome; but an anxiety response pattern can arise from countless internal 
and external sources. Parenthetically, it is familiarity with the specificity of the 
majority of physical disease entities which has made it difficult for medicine to 
recognize the cause of rheumatic fever, or to accept Selye’s concept of ‘“non- 
specific stress’”’ disease. 

It is methodologically preferable, even if not an absolute necessity, to use in 
psychology a more developed method for isolating response patterns, and to isolate 
these before considering etiology. Emphasis must be placed on recognizing and de- 
fining anxiety, neurosis and stress, by the refined and sensitive factor technique, 
directed to recognizing distinct unitary response patterns. The complete definition 
of the pattern will, of course, require supplementation by sequential, P-technique 
studies; incremental R-technique studies, as introduced below; and general etiologi- 
cal studies, centered on the recognized response pattern. 


2. THe EvipENcE FoR A FUNCTIONALLY UniTary ANXIETY STATE AND TRAIT 
To the question whether there is one or several major factors in the area of res- 
ponses considered to be anxiety responses, the unequivocal answer of at least seven 
researches on adequate population samples: ® &- 1%. 1%. 16. 17) is a single factor in be- 
havioral tests (T-data), but several factors in questionnaire and life record (rating) 
variables, 7.e., in Q- and L-data!®: ). This paradox of the one and the many can be 


‘For economy of presentation we shall adhere to the practice started elsewhere: * 1%) of desig- 
nating the three media of observation: L-data (Life record, behavior in situ; by rating or time samp- 
ling); Q-data (Questionnaire, consulting room, self-report evidence); and T-data (Objective, behav- 
ioral, miniature-situational response in standard, non-questionnaire, test presentations). 
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resolved neatly enough, but only with sufficient attention to technical issues, as 
given below. 

Let us first look at the single factor in objective behavioral tests, which thus 
strikingly confirms the clinician’s unitary concept. It has been symbolized as U. I. 
(T) 24 in the proposed universal index series for personality and ability factors. In 
Table 1, the loadings for this factor as found invariant (save for a due range of ran- 
dom error) in the two adult and two child populations collated earlier“, and the 
Rosenthal °*) and Scheier“?) studies since, are set out. The table follows the usual 
conventions in defining patterns “ee * 7. 8. nd 13)) namely, salients common to 
studies first, followed by salients found in single studies only; reversal of verbal labels 
for tests where signs are negative; and identification of experimental test by a master 
index number through which the actual test can be obtained® for further ex- 
periment. 


TaBLe 1. Loapinc Patrern oF U. I. (T) 24: GeneRAL ANXIETY 





Manifestation (Variable) 


Master 


Index Title, with Present Direction 


Number 


Factor: Series Identification, 
Sample Size and Research 
9(- 12 5 1 1 


500 250 184 


(8) 


(8) (16) 


100 70 80 
(2) (28) (17) 





Common to Several Studies 

High Annoyability 

Willingness to confess frailties 
Tendency to agree 

Absence of questionable preferences 


High ratio hard-headed to sentimental-indulgent 


attitudes 
High emotionality of comment 
Low confidence in untried performance 
Good grades (intelligence out) 
High verbal ability 
Critical severity 


52 
-42 
24 
33 


28 


39 021 
-19 -43 
49 

09 


48 


49 10 
-38 -36 
26 «18 


33 


35 17 
—20 

18 

19 





Found in one Research 

Low hand steadiness (tremor) 

Lesser saliva volume 

Good ethical choices 

Logical inconsistency of responses 
Higher ratio acquaintances to friends 
Low “contact with reality’’ 

Slower tempo 


Greater sex erg tension by objective attitude tests 


Less increase of heart rate after startle 





Questionnaire and Rating Variables 
16 P. F. Q, Ergic Tension 
16 P. F. O Anxious-depression 


16 P. F. Q;(—) Low Will (Self Sentiment) 


16 P. F. C(—) Low Ego Strength 
16 P. F. L Protension (Paranoid) 
16 P. F. M Alutia 

16 P. F. H(-) Threctia 
Checklist of Anxiety Symptoms 
Anxiety rating by psychiatrist* 


47 
33 
60 
-20 
25 
18 18 
-36 -18 
5657 
39 





*Mean of 2 psychiatrists. Only this factor was significantly loaded in the ratings of both psy- 
chiatrists, 


The identification of this sufficiently replicated pattern with the concept of 
general anxiety is made, first, by virtue of its containing the tests of such diverse 
anxiety manifestations as irritability, emotionality and tension, lack of confidence, 
self criticism, tremor, reduced salivary secretion, etc. The significant loading in 
“less increase of heart rate by startle’ is presumably due to the anxious person hav- 
ing already, like Macbeth, “supped full of horrors’. Secondly, it is made by reason 
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of the very large loadings it contains in those questionnaire factors that have anxiety 
content; in the anxiety symptom check list; and in the psychiatrist’s rating. In both 
cases the logical proof requires also that these do not appear together in any other of 
the twelve to eighteen personality factors“*’, and this condition, we find, is also met. 

A number of important and decisive inferences can be made from the special 
combinations of behavior found in this pattern, notably the combination of tension 
loadings with good school grades and good ethical choices; but space compels us to 
transfer the discussion elsewhere “*) and to attain here simply the demonstration of 
a consistently replicated, complex pattern. However, before leaving this pattern, to 
deal with its further relations— to Q- and L- data patterns, and to neuroticism— 
the theoretically very important fact should be mentioned that it has been estab- 
lished not only as a pattern in individual differences (R-technique) but also as a 
pattern of coordinated change within the individual (by incremental R-technique 
and P-technique). In other words it is a dimension defining a state dimension as well 
as a trait. Indeed the individual differences in trait may be nothing more than rela- 
tively permanent differences in state“ level. 

The pattern found for the incremental R-technique (factoring of differences be- 
tween two anxiety stimulus levels) is set out elsewhere “”) and the match is very good. 
It adds: higher systolic blood pressure, distractibility, and fluency in self criticism. 
Parenthetically the reader should note that when we say factors in independent 
studies ‘‘match” we do not mean merely by impression, and inspection of a table, 
but that: (1) the two independent simple structures, obtained without rotation pre- 
judiced by any theories, has been checked by Bargmann’s test for significance, 
and by the secondary criterion that at least 60% of variables must fall in the hyper- 
planes, and, (2) the pattern agreement of two researches has been checked on actual 
common marker loadings by the Cattell-Baggaley ‘‘s’’ index for significance of factor 
similarity“. 

In the completeness of this demonstration of the functional unity of the anxiety 
pattern, however, one lacuna still exists. The riddle remains that we have not yet 
been able indubitably to recognize the factor in P-technique studies“. This is large- 
ly due to an accidental deficiency of the necessary markers for identification in the 
studies concerned. The resultant uncertainty consists in that the U. I. (T) 24 match 
could be either with U. I. P. F. 9 (—), previously labelled General Autonomic Activ- 
ity, or with U. I. P. F. 7 (—) the Basophil-neutrophil ratio pattern“. The former is 
more probable, and makes better sense theoretically in that it contains the chief phy- 
siological variables of autonomic over-activity; but an important study remains to 
be carried out here to give that dependable identification of the P-technique pattern 
which is now needed for theoretical advance. 


3. Toe EvIpDENCE FOR THE DISTINCTNESS OF THE NEUROTICISM DIMENSION 


Now the especial value of the above location of the “general anxiety factor”’ in 
objective (and physiological) measures— as distinct from the identification in the 
latter patterns (below) occurring in Q-data only— is that it permits a clear dis- 
tinction of anxiety and neuroticism. For a factor has been found in the same T-data 
series, simultaneously, in several studies, indexed as U. I. (T) 23, which there is good 
reason to identify with neuroticism. 

First, as to the sufficiency of confirmation of the U. I. (T) 23 pattern in itself: as 
early as 1955 it had been verified “ in seven independent studies— five in our labora- 
tory and two in Eysenck’s— with groups of considerable diversity: adults, children, 
psychotics and neurotics (See Table 2). Blind simple structure rotations have pro- 
duced it three more times since 19555 17. 28) and one of these researches was an 
incremental R-technique, showing that over a two month interval people rise or fall, 
slightly, simultaneously, on the key variables on the pattern (Column 1, Table 3). 
Conceivably we are again dealing with trait (individual) differences that are largely 
state (occasion) differences, but, compared with anxiety, the function fluctuation is 
apparently less and over longer periods. 
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The proof that this dimension is distinct from anxiety is twofold: (1) The con- 
tent, i.e., saliently loaded variables, is quite different, and (2) in practically every 
research in which U. I. (T) 24 has appeared, U. I. (T) 23 has also been found at the 
same time. Therefore, U. I. (T) 23 could not be U. I. (T) 24 reappearing in a different 
pattern of unrelated variables in a different research. Indeed, in the simple structure 
plots of 23 and 24, it is possible to see their hyperplanes very distinctly and to de- 
termine their relation (angle of hyperplanes) with considerable exactness. (They 
prove to be slightly but significantly positively correlated.) 

Compared with the demonstration of unttariness, and distinctness from anziety, 
the proof that U. I. (T) 23 also corresponds to the clinical conception of neuroticism en- 
counters some complications. The initial proof that the variables in U. I. (T) 23 
distinguish neurotics from normals, better than any other known tests, we owe to 
Eysenck, Himmelweit, Furneaux and others of the Maudsley group“: *). But the 
present writer cannot accept that Eysenck’s principle of criterion rotation is a self- 
sufficient principle, and, indeed, it is subject to the same criticism as the obscure 
practice of validating anxiety scales in terms of differences of neurotics and normals, 
mentioned above. In criterion rotation, it is true, the crass assumption is not made 
that the criterion is simply “anything distinguishing neurotics from normals’’, for 
the extra condition is required that one be able to produce a factor pattern corres- 
ponding exactly (except for chance error) to the difference pattern. Nevertheless, 
with present techniques, if neurotics differed from normals both in neuroticism and in 
anxiety level, a complex factor confounding the two might be mistaken for “neu- 
roticism’’. 

As it happens, the certainty (statistical significance) v ith which simple structure 
has been obtained in the five researches in our laboratory, and the simultaneous in- 
dependent demonstration we have made of the anxiety factor, show that Eysenck’s 
criterion rotations did not happen to confuse two factors. U. I. (T) 23 isnot U. I. (T) 
24, and it remains the factor most powerfully distinguishing neurotics from normals. 
But is it exactly what the clinician calls neuroticism? 


Since the largest study °° was on severe and hospitalized neurotics, the con- 
taminating dimension next most open to suspicion would be psychoticism. Eysenck, 
however, has always been very alert to this possibility, and has compared neurotic 
and psychotic groups, with results favorable to the theory that neuroticism and 
psychotism are entirely distinct directions of maladjustment. Our own results 
indicate a factor U. I. (T) 25 apparently identical with the pattern he obtains dis- 
tinguishing psychotics from normals. It is not, as yet, replicated exactly enough, on a 
sufficient number of variables, in a sufficient number of studies, to stand alongside 
U. I. (T) 23 and 24 in quality of definition, and reliability of pattern measurement. 
But it is clear enough to show that we are encountering a dimension of behavior dis- 
tinct from neuroticism and anxiety, and peculiarly distinctive of psychotic groups 
when contrasted with neurotics and normals. 


Thus if the separation of groups designated neurotic, psychotic and normal is the 
pay-off, U. I. (T) 23 is neuroticism, and for, the rest, cynics may ask ‘What and 
where is this agreed concept, among clinicians, of neuroticism, which U. I. (T) 23 
could, perhaps more exactly, depict?’ The consensus on neuroticism definitions is 
indeed a very limited one; but perhaps it covers ‘excessive conflict of ego, id and 
super ego, without solution by either successful repression or stable expression.” 
The doubt which caused us ©: * 8 1 18) to call U. I. (T) 23 “Low Neural Reserves vs. 
Ego Strength”’, instead of ‘“‘Neuroticism”’, arose from the fact that signs of conflict 
are not particularly conspicuous in the variables, but, instead, there is chronic 
fatigue, lowered coordination, and inability to cope with behavioral problems— 
which might be the consequence of prolonged conflict, rather than the conflict itself, 
or even the result of physical illness. As Thorne points out, a pattern characterized 
by exhaustion, mixed symptoms, overwroughtness and inability to concentrate on 
immediate stimuli has been clinically recognized, e.g., in Spencer Cowles’ “acute 
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neural exhaustion state”, representing an organismic failure to achieve integration 
under stress as in the classical ‘‘neurasthenia’’.? 

It is this generalized breakdown which is almost undoubtedly the clinical ex- 
pression of extreme U. I. (T) 23 measures. The particular form of the neurosis— the 
specific pathognomonic signs and relationships— will depend on the specific dynamic 
conflicts, and on those temperamental and general personality factors, e.g., surgency- 
desurgency (F), cyclothymia-schizothymia (A), tendency to anxious depression (O), 
which express the essential personality “* as it persists through the neurotic or psy- 
chotic translation along factors U. I. (T) 23 and 25. It has been with the intention of 
retaining the meaning for U. I. (T) 23 (—) as a more generic ‘exhaustion vs. immed- 
iate capacity to mobilize’’ that we have generally avoided the designation ‘neurosis’ 
and used “: ') “lowered neural, hypothalamic reserves (incapacity to mobilize) vs. 
ego strength”’. 


TaBie 2. Tue Loapinc Parrern or U. I. (T) 23 (—), Neurotictsm vs. Eco StrenetH 





Manifestation (Variable) Factor: Series Identification, Sample Size and Research 
10 8 7 7 5(- 


Objective tests + 
Master Title, With Direction in Present 370a 370a 100a 100a 184c 105a 98a 


80 
Index Factors (See (6)) (6) (6) (6) (16) (20) (21) (17) 
Number 





Common to Several Studies 
2 High motor rigidity 5 2 58 20 37 
41 Poor two hand coordination 2 -20 (15) -37 -36 
42 High body sway (and suggestibility) 2 za 
120 High ratio of inaccuracy to speed 20 2 36 
28, 223 Low “dynamic momentum’’ (endurance 
of difficulty) 0 12 % 
151 Slower decision on particulars than 
principles 
32, 1 Larger p G. R. to mental than physical 
threats 
101, 148 Much excess of aspiration over achieve- 


-31 -27 


ment 

162 High ratio of shock to normal per- 
ormance 

1 Low fluency 

167 Poor immediate memory 

216 High suggestibility about shock 

51 Low carefulness of observation 


Found Essentially in One Research 


Much mind wandering (Inability to concentrate) 
Few figures added correctly 

More distraction (by esthetic temptation) 

Less spread than localized effort 

Less renunciation 

Less inhibition from failure 

Low metabolic rate change with stimulation -57 
Low salivary output 

High P. G. R. absolute resistance 

More pupil dilation in stress 

Avoidance of stronger smells 








Ratings, Questionnaires 

16 P. F. Low Ego Strength C(-—) 21 

16 P. F. Anxious Depression O 23 

16 P. F. Lack of Will Control Q; (-) 26 





Parenthetically, the reader familiar with the standard factor representation will 
recognize that U. I. (T) 23 is presented here in the opposite to the usual positive 
direction, since, U. I. (T) 23 (—) is the ‘neuroticism or ego weakness’”’ direction, with 


2Personal communication. 





THE CONCEPTUAL AND TEST DISTINCTION OF NEUROTICISM AND ANXIETY 227 


which we are here concerned. (Experimenters should thus be alert to the changed 
direction of loading and scoring from the standard U. I. (T) 23 presentation.) The 
central feature in the two dozen varied performances above is surely a form of fatigue 
(rigidity has been shown to increase with fatigue), a distractibility and a poor capac- 
ity to mobilize appropriate immediate responses (which, by the factoring out, is 
proved to be distinct from low intelligence). Premature crystallization of an unduly 
specific hypothesis about this incapacity is perhaps best avoided by the label “low 
neural reserves” or “‘debility’’, defining an effect on all kinds of central nervous sys- 
tem performance: cortical, cerebellar and hypothalamic. Evidence from P-technique 
and incremental R-technique will show that though this can be labelled a kind of 
fatique, it is of longer duration and different in nature from ordinary diurnal fatigue, 
which is demonstrably a statistically distinct factor, though lateness of day also 
contributes, slightly, positively, to this ““over-wroughtness’’ factor (see below). 

To complete this evidence of functional unity of U. I. (T) 23, the factor has been 
demonstrated by Scheier“’) in analysis of increment scores on the same markers 
(with students, respectively well rested and after hard work, immediately before 
examinations) and independently by Cattell“ and Haverland ™? in P-technique as a 
factor initially labelled ““‘Overwroughtness vs. Untapped Reserves” (P. U. I. 8). The 
alignment of these patterns is shown in Table 3. It is not yet as good as could be 


TABLE 3. GENERAL NevroticisM (U. I. (T) 23 (-) ) vs. Eco Srrencru as a STaTE Factor 





Manifestation Factor: Series Identification, Sample Size 
Title, With Direction in Present Factor and Research oO) 
1 10(- 


7 
Incremental P-tech. P-tech. 
80 cases 1 case 1 case 
(17) (See (7) ) (24) 


Higher disposition rigidity 32 54 17 
Slower reaction time, especially unwarned 11 44 14 
Smaller myokinesis —24 —44 

Lower fluency 07 —38 
Lower endurance -29 
Reduced hand steadiness -11 —42 
Short hours of sleep previous night 51 26 








Jokes rated unduly funny 55 
Small movements in myokinesis 44 
Lower ratio accuracy to speed -38 

Higher sway suggestibility 10 

Lower ego strength; 16 P. F. Factor C(-) -31 

Greater id tension; 16 P. F. Factor Q, 16 

Hectic activity (obs. rating) 

More acceleration by competition 24 

More response error in complex reaction times 20 





desired, the pattern similarity significance centering on P = .03 for the three 
matches. However, as stated, it is at any rate important that the pattern is clearly 
distinguishable from the diurnal fatigue factor in the same series. This study shows 
that the pattern elements found in the factoring of individual differences— high 
rigidity, sway suggestibility, low fluency, low endurance, low effectiveness of con- 
centration in calculation, etc., has elements which also vary together in one person 
with time and circumstance. Further, it adds that the “‘neurotic’”’ pole is also con- 
nected, nevertheless, with social overactivity and excitability. 

The essential nature of U. I. (T) 23 (—) might, in summary, be designated as an 
inability to coordinate and control responses of almost all kinds in immediate activi- 
ties. One is strongly tempted to call it “lack of integration”, but this would invite 
confusion, for there is no evidence of lack of true dynamic integration, i.e., of harm- 
ony of goals and self-sentiment values, but only of inability to effect immediate 
integration in muscular control, psychological impulse, memory, the overcoming of 
fatigue, etc. Indeed, all the evidence points to this other kind of lack, t.e., of dynamic 
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integration, being synonymous rather with U. I. (T) 24, Anxiety, as seen below. The 
word immediate is therefore italicized to emphasize that the “lack of integration”’ is 
not of a wider character, visible in purposes and values, but simply represents in- 
ability to control immediate motor activity, thoughts, productivity and impulses, in 
mobilization toward immediately accepted goal. Now this may well be the correct 
meaning of “ego strength’’, in Freud’s description“), though some clinicians have 
allowed the meaning to drift into “‘general dynamic integration’. 

In favor of this identification of U. I. (T) 23 (—) with ‘debility vs. ego strength”’ 
is the fact that its only consistent correlation with a questionnaire factor has been 
with 16 P. F. factor C (—), independently denoted, earlier, as Ego Strength. Doubt 
about complete identity, however, arises from the facts: (1) that the correlation has 
not been as high as would be expected, with attenuation correction, for a comp!ete 
identity, (2) that this correlation might really be no higher than would be expected as 
part of the correlation of U. I. (T) 23 (—) with U. I. (T) 24, which shows itself, second- 
arily, as a series of quite low correlations of U. I. (T) 23 (—) with the questionnaire 
factors, Qs, Qs (—), O and C (—) which highsy ,oad U. 1. (T) 24 (See Table 1), and (3) 
that questionnaire factor I (Premsia or Sensitivity) correlates as highly with U. I. 
(T) 23 (—) as does C (—), in certain samples. Granting this much uncertainty, the 
favored hypothesis is sti that C and U. I. (T) 23 are identica , and a ike appro- 
priately interpreted as Ego Strength vs. Debility (or Generalized Neuriticism or 
Impaired Immediate Ability to Mobilize. 

As stated above, this neurotic debility is not to be considered as the specifically 
recognized neurotic behavior per se, which is determined partly by the specific pre- 
cipitating environmental stress and the pre-neurotic personality. It consists instead 
of the essential ego weakness, visible in the multiplication of defense mechanisms, 
in instability, in a sense of inadequacy, and in a preponderance of dissatisfied emo- 
tionality and impulsiveness, as brought out most clearly in the L-data (life record) 
factor: ) pattern for C (—). The first alternative to explore, if the hypothesis of 
identity of C and U. I. (T) 23 fails, is that U. I. (T) 23 (—) is merely a more restricted 
“exhaustion” factor arising from C (—); but this is less likely. 


4. ANxrETY ORIGINS AND Expressions SHOWN IN THE SECOND ORDER FAcTOR 
Returning now to the anxiety factor, we have to investigate the paradox that 
though there is one U. I. (T) 24, there are several personality factors, with discernible 
anxiety content, in L- and Q-data. Fortunately, we are granted a rapid denoue- 
ment. The recent, careful, second-order factor analysis over the primary factors in 
personality shows that both in L-data“*® and Q-data®?, these particular primaries, 
covering diverse aspects of such behavior, come neatly together in a second-order 
factor, which has been called Anxiety vs. Dynamic Integration. In diminishing order 
of loading (combining Table 4 and L-data results in “*’) the factors are: Q; (—) Lack 
of will control or defective self sentiment formation; Q,, Ergic tension (Id pressure) ; 
O, Anxious depression and guilt; C(—), Ego Weakness; L, Protension or paranoid 
trend; and H(—) Threctia (or susceptibility of sympathetic system to excessive res- 
ponsiveness to threat) together with one or possibly two less constant factors shown 
in L-data in“* but not in the Q-data of Table 4. Certain conglomerate “anxiety 
scales” and symptom check lists succeed °*) in falling largely in this second-order 


TaBLe 4. Tue Seconp OrpER ANxrieTy Factor APPEARING AMONG Primaries IN Q-DatTa 





Symbol Factor I Anxiety vs. Dynamic Integration Loading 


Lack of Will Control* or Poor Self Sentiment Integration 66 
Anxious-depression or Super Ego Susceptibility .62 
Ergic Tension or Id Pressure .62 
Paranoid Trend (Protension) 54 
Lack of Ego Strength .53 


*The titles used are those of the 16 P. F. Handbook“*). To make easy, consistent reading all 
loadings have been made positive, by reflection, in these factor lists. The reflection is indicated by the 
sign on the left and the change of title to the opposite pole, e.g., “Lack of...” 
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factor, and are evidently varying mixtures of the primary personality factors just 
defined. Inspection of the original rating“*) and questionnaire“®) items will show 
that these primaries cover all primaries that could conceivably be interpreted as an- 
xiety, except possibly primaries I, (Sensitivity, or Premsia) and F(—) (Desurgency), 
whose exception is discussed elsewhere “®?. 

Although the second-order structure, analysed on three independent popula- 
tions, is very convincing “*), the evidence for such a second-order factor in Q- and 
L- data, and for its identity with the first order objective test anxiety factor U. I. 
(T) 24, does not rest on factor analysis alone. For, additionally it has been found 
that: (1) These same six factors all show very similar change scores with age, being 
high in adolescence and dropping to early maturity (differing in this from other pri- 
mary factors) “®. (2) In all studies“: !”. 28) where objective tests and questionnaire 
factors have been measured together, the above siz correlate directly and very sub- 
stantially with U. I. (T) 24 and only with U. I. (T) 24— except for C, as stated above. 
They do so with the same sign and magnitude as would be expected from the identi- 
fication of the second order factor with U. I. (T) 24 (See Table 1). This includes both 
absolute and incremental factorings. (3) Important sub-tests of U. I. (T) 24 and 
questionnaire responses have been shown to change simultaneously, as their unity 
requires, in students tested at and remote from examinations, and in air pilots before 
and after threatening flight experiences. (4) These same six factors have an identical 
pattern of correlation with other major entitites (Tryon’s criterion of functional 
unity “°), notably with measures of interest in career, in self sentiment and in need 
for security “*. 

Granted this demonstration of a functional unity among certain primary factors, 
assessed either from L- and Q-data, at the seecond order level, and its identification 
with U. I. (T) 24 in T-data, how do we explain (a) the non-appearance of six corres- 
ponding primary patterns directly in T-data, and (b) the nature of the relation of 
the six Q- and L-primaries to the unitary second-order or (T-data first order), which 
is, after all, the essential anxiety unity running across Q-, L- and T-data? 

One explanation why the primaries do not show up in T-data (and that therein 
we pass instead directly to the second-order unity) could be that the T-data ob- 
servations are concerned with very general behavioral and with physiological res- 
ponses, not embedded in the more specialized and particularized ramifications of life 
purposes, attitudes and self concept values capable of yielding the more refined 
structure found in L- and Q- data. The change from a single factor to a six factor 
group would thus be explained analogously to the transition from a lower to a 
higher power on a microscope, resolving a single blob of bacteria into several mem- 
bers, but this description still leaves us in need of a hypothesis to account for the 
primaries adhering together in this way. Confirmed, empirical, second-order factors 
are still too few to have yielded any generalization about the wsual causal depend- 
ence in such cases; but we know from statistical logic that the elements loaded in a 
factor can be either causes or consequences of the factor itself. We also know that 
probability is on the side of their being consequences, since it is hard to imagine a 
lot of essentially independent causes being as highly correlated as would be necessary 
for them simultaneously to load one factor. However, the only sure answer is to be 
obtained through supplementing the factor analysis by controlled experiment. In the 
following hypotheses to explain the origins and consequences of anxiety, developed 
more specifically by present quantitative findings from hypotheses put forward seven 
years ago on general psychoanalytic and clinical evidence’, some loadings of pri- 


maries in U. I. (T) 24 are made causes and some effects, according to ancillary in- 
dications. 


1. The hypothesis that anxiety will be pryperienel to id pressure seems to be born out 
0 


by the Q, component, one of the largest of the six. 
infancy, to the extent that it may have to be considered total ergic tension “*) rather than repressed 
ergic tension only. This still does not identify anxiety with “general drive’; but says that only 
one of the components in anxiety is total undischarged ergic tension. High total ergic tension, from 
whichever of the standard causes“*), could produce anxiety directly, by threat to the ego, 1.e., to 


wever, our present knowledge of Q, is in its 
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the master need to maintain way or because to have more needs in the environment is to have 
proportionately more danger of loss, frustration, suppression and repression. This hypothesis 
therefore states that anxiety is a derivative of ergic tension, and not that all ergic tension is trans- 
formed into anxiety or that all anxiety is ergic tension. 

2. As hypothesized©. *) ego weakness is associated with anxiety, as shown by the con- 
sistent C(—) and U. I. (T) 23 (-) correlations. But this may either confirm the more specific hy- 

thesis“) that fear of overthrow of the ego generates anxiety, or the converse causation that 

igh anxiety (especially in the developmental period) encourages ergic regression and repression, 

and, therefore, impoverishment of ego development. Or both causal directions could operate, as 
far as this present evidence of association goes. 

3. The powerful role discovered for the guilt-depression factor, O, may or may not support 
the hypothesis, central, in different ways, in the anxiety theories of Freud@) and of Mowrer @?) 
that super ego threat is responsible for much anxiety. There seems little doubt, from the content 
of O (emphasis on fear, guilt, inadequacy, dependency, and high standards) and from its appear- 
ance in extreme form, as a psychotic “guilt’”’ factor “*), that it represents either a super ego devel- 
opment or, more likely, a dependence which makes the individual prone to marked super ego de- 
velopment. Until this is cleared up, and the relation to the “effective” super ego pattern G (in 
the 16 P. F. factors) clarified, we can speak only of a special “depressive anxiety’? component 
(distinguishable from simple depression or desurgency, F(—), which is not involved in U. I. (T) 24). 

4. No previous hypothesis accounts for the loading in factor L, protension or paranoid 
trend. L’s description by content, and its effect in group dynamics “*), has been: inner tension, 
suspicion and a special tendency to the projective (in the true psychoanalytic sense) defenses. 
The false, dangerous, paranoid relation to society may generate anxiety, (through partial insight) 
or, more likely anxiety, as a social insecurity of unspecified origin, may cause either (a) paranoid, 
defensive over-self-evaluation or (b) cognitive deterioration through prolonged anxiety, in ac- 
cordance with physiological theories of paranoia. 

5. The excessive preoccupation of many clinical psychologists with elaborate theories per se 
ea accounts for neglect of the obvious quantitative hypothesis that if a capacity to be appre- 

ensive of future dangers is part of anxiety then the individual’s anxiety level will tend to be pro- 
portional to (a) the situational dangers and (b) the constitutional strength of his proneness to 
timidity! It may be, however, that timidity of disposition is not the same as level of fear erg 
tension. Indeed, when ergic tension pane th have been measured along with the present var- 


iables, as in the work of Rosenthal °*), Scheier“?> and Wenig“), no significant positive relation 
has been found between fear of real dangers (ergic tension of fear in relation to standard dangers) 
and anxiety. The only ergic tension consistently found substantially correlated with anxiety has 


been that of the sex drive “*. '7. 28), fitting Freud’s theory that sex is paramount among the un- 
discharged drives (our Q, component); and possibly, also, his theory of “the psychosexual con- 
stitution.”’ 

In spite of this lack of relation of anxiety to the fear erg, found as a dynamic, ‘ergic tension’ 
factor “*), we find a very definite relation (Table 1) to timidity as a temperamental factor, H(-). 
Dimension H was initially, descriptively, called Boldness vs. Shyness (and is probably represented 
also by Guilford’s S factor @ according to an r of 0.85 recently obtained between H and 8), but 
in view of increasing evidence of autonomic associations, specifically of parasympathetic blocking 
of sympathetic fear response, in H+, it has been renamed, by hypothesis, Parmia vs. Threctia. 
The threctic pole, H(—), defining “ready autonomic responsiveness to threat’’, correlates 0.33, 
0.18 and 0.36 (in samples of 500, 80 and 80 respectively) with anxiety, indicating that a disposi- 
tion to timidity (for H is highly inherited “*) is not a negligible consideration in the genesis of 
anxiety. It is probable that an individual's scores on both H(—) and O represent a product of his 
proneness to situational anxiety and the strength of threat at the time, whereas CC, L and Q; 
(-) would be expected to have little relation to the situational threat. The H (-) loading thus 
supports our “situational timidity’ theory. 

The most highly loaded primary in the second-order factor (Table 4 and “*)) is Q;(-), 
and it rivals Q, for highest mean loading (correlation with) U.I. (T) 24 in Table 1. Corrected for 
attenuation these loadings could indicate that Q;(—) is virtually the essence of the second order 
factor. Q; has been interpreted, through its content and its many criterion associations “*®, as a 
will control factor representing the organized self sentiment in action. That is to say, it represents 
the strength with which the individual strives for behavior integrated according to a consciously 
approved (but not unrealistically idealistic) self concept. 

That Q; should, factor analytically, be the cornerstone among the six primaries supporting 
the general anxiety factor, is of great psychological interest. It suggests that the effectiveness of 
the Q; striving for total integration is limited by the other conditions— degree of ego weakness, C 
(-); super ego pressure, O; total ergic tension demands, Q,; etc.,— and that anxiety is nothing less 
than the degree of failure in the final dynamic integration. As dirt is matter out of place, so anx- 
iety is ergic tension out of control. Or, to refine the simile, anxiety represents the extent of the 
threat to the organism's effective dynamic expression as an integrated system, in running equilibrium. 
If so, the true opposites in our conceptions should be anziety vs. good dynamic integration, not 
neuroticism vs. integration. Neuroticism, U. I. (T) 23 (—), must therefore be conceived not as a 
lack of dynamic integration, but only as that special contribution to mal-integration which arises 
from ergic regression followed by defective build up of ego strength. The resulting chronic, sys- 
tematic failures to face ergic and super ego demands, accompanied by constant anxiety of ego 
overthrow, constitute the specifically neurotic contribution to anxiety. 
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In so far as it is necessary to relate these findings to current psychiatric divi- 
sions of anxiety into ‘‘free’’ and “bound” anxiety, and into central (intrinsic or in- 
herent) and peripheral (or situational) anxiety, it should be noted that in all except 
factors (H—) and Q, we are dealing with anxiety inherent in the system—a part of 
the make-up of the personality. The strengths of factors H(—) and Q,, by contrast, 
represent in each case a function of interaction of the characteristics of the individ- 
ual and the situation. H(—) represents the degrees of arousal of apprehension, as a 
product of the individual’s timidity and the strength of environmental threat: Q, 
represents the total ergic excitation, as a product of the individual’s excitability and 
the level of stimulation in the environment. By the nature of factor analysis these 
expressions in the individual of combined susceptibility and stimulus intensity ap- 
pear in present techniques as single factors, though there are ways“? of fissioning 
them into organismic and environmental factor components if these exist. The 
fundamental equation for an ergic tension factor score“: »- ®), breaking it into 
situational and other components, applies to H(—) and Q, here as to any other ergic 
tension. A calculation from Table 4 "4 |) suggests that the two dispositional- 
situational sources, H(—) and Q,, contribute little more than 1/3 to the total individ- 
ual difference variance in anxiety; but this must be rough until the second order 
loadings are more exactly evaluated and the situational terms in the equation de- 
termined. 

As to the psychiatric concepts of free and bound anxiety, there exist two factor 
dimensions to which they could be attached. Probably the obsession-compulsion 
neurosis, with which the idea has been principally associated, is not common enough, 
in those “normal’’ population samples in which factors have been established, to 
yield a factor of large variance, and probably K (—) and M (-—), both of which have 
low loadings in the second order factor, best represent this obsessional ingredient. 
But, granting that binding is defined to mean also socially apt and useful habit dis- 
charge of ‘‘anxiety”’, then Qs, with all its emphasis on socially approved behavior and 
self control, is the major measure of the individual’s success in binding anxieties 
through effectively stabilized habits. Since Q;, K and M are all negatively loaded in 
the major second order factor, this makes the anxiety factor (and U. I. (T) 24) in 
our discussions synonymous with free anxiety. Indeed, by the structure shown here, 
bound anxiety, 7.e., successful, stable, habit expression, is the very antithesis of 
anxiety, for it is the integrated energy discharged in a dynamic equilibrium through 
Q;. This suggests that clinical conceptions would be clearer if ‘drive’ were substi- 
tuted for the expression ‘bound anxiety’’. For though drive may at some phase have 
expressed itself as anxiety, when bound it is certainly no longer anxiety, and no more 
entitled to be called anxiety than to be called any other drive, e.g., sex, aggression, 
from which it may have been derived before passing into the bound state. 


SuMMARY 

1. A general factor of anxiety— U. I. (T) 24— is found, at the first order level, 
on objective measures of anxious behavior of all kinds and as a second-order factor 
aene questionnaire (Q-data) and life-record-in-situ (L-data) primary personality 
actors. 

2. A factor of general neuroticism, or better, neurotic debility vs. ego strength, 
U. L (T) 23 (-), has repeatedly been found to distinguish neurotics from normals, and 
to be distinct from the anxiety factor (though slightly, significantly correlated with 
it in most populations). 

3. Both the anxiety factor and the neurotic debility factor have been demon- 


strated as patterns in states, as well as traits, i.e., as unitary factors in incremental 
measures from day to day. 


4. The L- and Q-data primary personality factors forming the second order 
anxiety factor, are Q; (—), Qs, O, C (—), L, H (—) and possibly K (—). A theory explain- 
ing these as causes and consequences of anxiety has been put forward through the 
checking of these findings with earlier hypotheses. 
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As a result of the above, batteries for the measurement of anxiety and neurotic- 
ism have become available to the clinician. In objective, behavioral tests, essentially 
ten sub-tests are available for each battery“. All twenty can be, and three have 
to be, individually administered. Six to ten sub-tests constitute an absolute mini- 
mum for reliable measurement of each factor with present knowledge as to available 
high-loaded tests. Research needs urgently to be directed to evolving tests with 
higher correlations with U. I. (T) 23 and 24. It is hoped that the current work of 
Scheier on the anxiety factor will result in the discovery by 1958 of such a battery, 
exceeding present saturations, and still of no more than one hour’s duration. Accord- 
ing to the method of estimate proposed elsewhere“) the existing 1955 batteries for 
neuroticism and anxiety have construct validities (correlations with the pure factors) 
of about +0.8. 

Meanwhile, U. I. (T) 24 can also be measured through the questionnaire med- 
ium, and an Anxiety Scale“? has therefore been constructed from the forty best 
loaded items, combining the Q-factors Q;(—), Qs, O, C(-) and L with due regard to the 
weights required for the most accurate estimate of the second-order factor. This 
Anxiety Scale is further divided into a score from ‘overt’, manifest questions, plainly 
dealing with anxiety symptoms, and a score from ‘cryptic’ questions which load as 
well on the general anxiety factor as the others, but do not seem to the patient so 
obviously to deal with symptomatology. However, the Anxiety Scale, requiring less 
than ten minutes to administer, is intended primarily to yield a single anxiety score. 
If good reliabilities are required for part scores, as a diagnosis of anxiety sources, ac- 
cording to the above theories, it would be desirable, at least in research, to supple- 
ment it with the 16 P. F. scores of the patient on the full test length for the primaries 
Q:(--), Qu, ete. This also would have the advantage, if the above hypotheses can be 
considered reasonably verified by the present data, that the particular scores on the 
factor measures, in the individual case, will throw light on the etiology of his anxiety 
state, on the chief dynamic conflict sources of his anxiety, and the manner in which 
he is channeling its discharge. 

Certainly the provision of distinct, factor-defined and reproducible tests, for 
anxiety and general neuroticism (the former either in a single objective battery or in 
a brief questionnaire permitting breakdov n into primaries) should make possible a 
wide array of meaningful experiments into which the clinician could not previously 
confidently enter. The existing evidence“ suggests that these batteries“®) are 
already sensitive enough not only to give significant findings for inference from in- 
dividual differences, but also, like a clinical thermometer, to register significant 
changes in one individual during the course of therapy. From a study of curves of 
anxiety production and recession, at various critical stages of therapy, far greater 
insight and control of the process than now exist might become common in the art 
of therapy. And, from the standpoint of research into general laws, these instruments 
should permit clear-cut answers in response to clear-cut hypotheses. 
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CRITIQUE OF RECENT DEVELOPMENTS IN PERSONALITY 
COUNSELING THEORY 


FREDERICK C, THORNE 
Brandon, Vermont 


INTRODUCTION 


The progress of science would be expedited greatly if the originators of significant 
contributions would make periodic progress reports concerning their latest thinking 
about recent developments related to theoretical systems. Much speculative con- 
troversy could have been avoided if Freud, Adler, John B. Watson and other theoriz- 
ers had issued periodic statements bringing their latest viewpoints up-to-date public- 
ly, and particularly where important revisions of earlier viewpoints had occurred. 
These comments are particularly true of the field of psychotherapy where there is 
great need for systematically relating and validating a large number of theoretical 
contributions which too often have not been assimilated into the body of scientific- 
ally established facts but have become the foundations of isolated schools or systems. 
The purpose of this article is to present a critique of personality counseling theory 
bringing up-to-date our most recent thinking on the subject. 


GENERAL REMARKS CONCERNING THEORIZING AND SYSTEM-BUILDING 


No one would voluntarily restrict creative theorizing which may supply hy- 
pothetical models for lucrative research design. There are definite dangers, however, 
in applied fields such as psychotherapy when theorizing and system-building far 
outstrip anything which has been established scientifically so that unvalidated 
theories are given large scale clinical application. It must be emphasized that in 
psychodiagnosis and psychotherapy, unproven theories and unvalidated practices 


are in current use far more widely than in any other clinical field. Unlike modern 
medicine in which schools and cults are practically extinct and where the subject 
matters consist largely of what has been established scientifically, in the field of 
psychotherapy we find a large number of competing theoretical systems and schools 
all being widely applied in the almost complete absence of validation and standard- 
ization. 

Theory-centeredness may lead to metaphysical and dialectic confusion which 
actually impedes the forward progress of science. One example of such tendencies 
exists in the current contents of many psychoanalytic and psychiatric journals of 
which a formal analysis of topics and methods reveals that 80-90% of the articles 
are purely speculative or anecdotal in contrast with a content of about 80 or 90% of 
experimental-statistical studies in basic-science oriented psychological journals. 
Unending speculative controversy goes on concerning the theoretical implications of 
even chance remarks made by Freud and other “authorities”. The situation is com- 
parable to the centuries-long speculations of the greatest minds of ancient times con- 
cerning the nature of the human circulatory system which was settled forever by 
Harvey’s simple empiric observation of the direction of blood flow in the veins by 
compressing and then releasing pressure on a vein on the back of the hand. The 
attempt to validate theories logically by constructing supposedly comprehensive and 
consistent systems is obsolete in an era when experimental-statistical validation is 
possible. It is regrettable that so much time is being currently spent in the attempt 
to secure respectibility for unvalidated clinical methods by relating them to theories 
which have some systematic prestige. 


PsYCHOANALYTICALLY-ORIENTED PsYCHOTHERAPIES 


__ Psychoanalytically-oriented practices probably have the largest current follow- 
ing and prestige in the fields of counseling and psychotherapy, particularly in educa- 
tional institutions and clinics which are administered by psychoanalysts. Since 1945, 
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the psychiatry departments of several major universities have been taken over by 
psychoanalysts and we have reports from such installations or cities that personnel 
without psychoanalytic training find it difficult to obtain or retain appointments. 

There is little evidence that basic science psychology is either well understood 
or utilized in many psychoanalytic centers. Indeed, a large portion of the time of 
many analysts is devoted to the training of other analysts in the same unvalidated 
methods, which condition may continue to exist as long as it is necessary to obtain 
such qualifications to gain local professional recognition. 

This phenomenon of the contemporary prestige of psychoanalysis exists in 
spite of the fact that today, more than 50 years after the appearance of Freud’s basic 
theories, there is still no adequate experimental-statistical validation of the system 
either in whole or part. While psychoanalytic formulations may receive empiric 
validation in individual cases, there is little acceptable evidence that they may be 
universally applied to all persons at all times. Without in any way disparaging the 
brilliance of Freud’s contributions or minimizing their importance as clarifying cer- 
tain classes of behavior data, there is still room for sound skepticism concerning 
current practices of projecting psychoanalytic interpretations on all case materials. 
In personal observations of psychoanalytic approaches (as well as many others), we 
have frequently noted therapists displaying considerable intellectual ingenuity trying 
to match standard theoretical interpretations to samples of behavior data but with- 
out much apparent awareness of the evident invalidity of such practices. This con- 
stitutes an example of extreme directiveness in which the therapist projects diag- 
noses and interpretations uncritically whether they fit or not. Our conclusion con- 
cerning psychoanalytic approaches must be that this method is essentially intuitive, 
anecdotal, empiric and unvalidated in its present stage of evolution since there is a 
dearth of experimental-statistical data. Individual applications, no matter how 
brilliant and productive, are at present empiric and intuitive. A great deal of the 
difficulty will be obviated, undoubtedly, as the phenomena uncovered by the psycho- 
analytic method are integrated and interpreted in terms of basic science psychology. 
Expediently, however, we will all go on making psychoanalytic interpretations until 
some better methods become available, but dangers can be minimized if greater 
criticality and caution are used. 

Similar comments apply to projective techniques based largely on Freudian 
interpretations. Many clinicians go to extremes in applying projective interpreta- 
tions to all behaviors ranging from global integrations down to the most incidental 
mannerisms. Such methods would not be too objectionable if diagnoses were properly 
qualified with limiting statements indicating their hypothetical and essentially un- 
validated nature, but they become grossly unscientific and bordering on charlatanism 
when stated with absolute positivity which rules out the consideration of other 
possibilities. It would be wise not to forget the lessons taught by the fate of the early 
methods of Freudian dream interpretation. While dreams may have symbolic sig- 
nificance, their latent meanings only can be understood in terms of the dynamics of 
the individual case to which an universal dream vocabulary or symbolism can rarely 
be applied. It is unwise to apply projective interpretations unless there is definite 
evidence that projective mechanisms indeed are operating. 


CLIENT-CENTERED COUNSELING 


Nondirective, client-centered methods of personality counseling have reached a 
stage of empiric validation where they are now widely accepted as a valuable part of 
our personality counseling armamentarium within the limits of their indications and 
contraindications. Utilized with clients who are motivated to understand and change 
themselves, and who have sufficient personality resources to work through to their 
own solutions once emotional blocks have been resolved, nondirective methods per- 
mit the client to reach his own solutions with maximum freedom. They are also 
valuable in many other clinical situations where acceptance, permissiveness and 
reflection of feelings facilitate the building of rapport, lead to more complete self- 
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expression, and encourage the development of initiative and responsibility in solving 
one’s own problems. When such methods are utilized within their indications and 
contraindications, along with such other more active methods as may be indicated, 
and without making excessive claims for their achievements unsupported by actual 
research results, there can be no quarrel with their use. 

Although it is too early to make a definitive statement, evidence appears to be 
accumulating that use of the nondirective approach is indicated more positively with 
“ego-negative”’ persons who are dissatisfied with themselves and anxious to change, 
and may be contraindicated with “‘ego-positive’’ persons who are satisfied with 
themselves and have no insight into the fact of maladjustment or the desirability of 
changing. Negative self-regarding attitudes, at least when not on an obsessive or 
psychotic basis, tend to motivate the client to face problems and work out solutions, 
and a minimum of therapist activity is indicated, at least until such time as direct 
intervention is proven more efficient. In our experience, the nondirective approach 
tends to fail or reach an impasse with self-satisfied clients who either see nothing 
wrong or who do not want to change. This includes the large group of psychoneurotic 
persons whose unconscious defenses may resist even massive intervention. 

Much of the dissatisfaction and published criticism of nondirectivism appear 
to have been stimulated more by the way in which such methods have been presented 
or advocated rather than with the techniques themselves. If nondirectivism had 
been advanced as simply a promising group of new methods to be integrated with 
older methods according to their indications and contraindications, it probably 
would have been widely accepted without controversy. However, the active pro- 
motion of the client-centered principle as the basis for a new and inclusive school of 
theory and practice, often advocated somewhat uncritically as a new panacea suit- 
able for all clinical situations, and even as a philosophy for the conduct of all human 
affairs, has stimulated critical reactions which have sought to cut the new movement 
down to size and integrate its contributions into existing knowledge. The efforts of 
nondirectivism to erect a theoretical rationale in support of its empiric achievements 
often have led to the making of claims which did not seem validated by actual re- 
search findings. Finally, the tendency of some nondirectivists to refer to themselves 
and act as if they were the exclusive proprietors of a new school whose methods 
would soon make obsolete all older techniques has quite understandably stimulated 
criticism from those who believe that all new findings must be integrated into basic 
science knowledge and methods. 

More than ten years ago, this writer published a critique“’’ of nondirective 
methods of counseling and psychotherapy motivated by our concern over the ap- 
parent failure of this new school to incorporate certain standard principles of ob- 
jective methodology in research and clinical approaches. A restatement of our basic 
criticisms in 1957 would point up five major defects of research design which serious- 
ly vitiate the conclusions which can be drawn from most of the research to date. 


(a) The Papers failure of nondirectivists to recognize the importance of valid diagnosis in all 
c 


scientific case study, and particularly in research design, makes it impossible to evaluate clini- 
cal claims and research results. 


(b) The failure to attempt to identify all pertinent variables diagnostically has made it im- 
ible to utilize standard research designs involving use of equated samples, rigorous identi- 
cation and manipulation of dependent and independent variables, and the objectification of 

all factors and steps known to be involved. 


(c) The failure to control, or otherwise partial out, the effects of other therapeutic factors (such 
as suggestion, catharsis, desensitization, etc.), which are known to be operating, makes it im- 
possible to draw any valid conclusions as to what is actually occurring in nondirective therapy, 


(d) The failure to check conclusions drawn from phenomenological data obtained from the 
client (or inferred by the therapist) with actual objective measurements of known factors or 
status may result in serious misinterpretations of the realities of the therapeutic situation. 


(e) Even if positive therapeutic results are demonstrable using nondirective methods, it still re- 
mains to be proven that such effects could not have been achieved just as well or even more 
expeditiously with other methods. 
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In conclusion, then, the failure to rigorously identify, diagnose and control all 
factors known to be operating in psychotherapy has seriously invalidated much of 
recent research on nondirective methods. We are not told whether the subjects are 
normal or abnormal, the nature and pathogenicity in terms of actual case history 
findings of the factors being therapized, and we are not convinced that the postu- 
lated factors actually explain the results obtained. In other words, most of the 
factors which we need to know about have been left undifferentiated and uncon- 
trolled. In effect, it appears that nondirectivism postulates one standard diagnosis 
for all clients, namely that emotional factors blocking growth are the most important 
cause of all disorder, hence diagnosis is not necessary and only one standard method 
(nondirectivism) is indicated. 

The most prominent success of client-centered methodological research has been 
in objectively demonstrating the actual operations which take place in this method. 
Nondirectivism has made a very great contribution in developing passive methods 
of counseling emphasizing acceptance, permissiveness, reflection and clarification of 
feelings, and placing the responsibility upon the client for working out his own 
problem solutions. Conversely, the dangers of crude directiveness, counselor-center- 
edness, and the invalid projection of interpretations upon the client’s behavior are all 
better understood as a result of insistence on the client-centered principle. The issue 
still remains, however, concerning indications and contraindications for use of non- 
directive methods with different types of clients, pathological processes, degrees of 
pathogenicity and other factors determining outcomes. 


Proponents of nondirectivism have been less successful in trying to construct a 
systematic theoretical rationale for their viewpoint. It appears that Rogers’ earliest 
contributions were developed by intuitive-empiric methods, interpreted at first“ 
in terms of somewhat mystical concepts of self-realization and growth, and more re- 
cently developed in terms of a thoroughgoing phenomenological approach). While 
such preoccupation with the self-concept and other phenomenological data as ex- 
perienced by the subject himself and understood empathically by the therapist is 
very intriguing and commendable in itself, considerable criticism may be directed 
toward the attempt to construct a complete system on such a limited approach. 
Even more caution should be invoked in evaluating claims by nondirectivists for 
unique processes and results from their methods, particularly when it is known that 
all therapeutic methods have their successes. Before we can ascribe much validity 
to claims made, for example, from changes on Q sorts given pre- and post-client- 
centered therapy, it will be necessary to obtain comparable Q sorts from clients 
treated with Coueism, Christian Science, Dianetics, pyschoanalysis and many types 
of directive methods (all of which have very enthusiastic clients and practitioners). 
It may be that certain standard factors underly all these methods and that specific 
philosophical rationales are superfluous. Snyder recognizes this dilemma when he 
states that ‘‘Perhaps it doesn’t make very much difference which philosophies guide 
a therapist’s work’”’.® While it is interesting to attempt to trace the philosophical 
antecedents of recent theoretical developments as when Walker“®) perceives Freud 
as the intellectual disciple of Augustine, and Rogers the intellectual descendent of 
Jesus, Confucius or Rousseau, there are dangers in implying that psychoanalysis or 
nondirectivism have some face validity from being related to such respected author- 
ities. Similar comments apply to the currently popular practice of using laudatory 
or discriminatory designations to categorize accepted or rejected concepts as when 
Snyder® categorizes neofundamentalism, scholasticism, behaviorism, psycho- 
biology, learning theory and directive counseling as being ‘‘authoritarian’’, while 
nondirectivism is identified with “‘self-actualism”’ or “‘self-determinism”’, idealism, 
democratic government, humanism, progressive education, hormic psychology and 
phenomenology. Such a dichotomy is purely arbitrary and probably artifactual, and 
even worse are the self-reflexive connotations of the term ‘‘authoritarian’’ as inter- 
preted against the recent associative contexts of Fascism, Communism, regimenta- 
tion and exploitation as contrasted with other possible connotations involving such 
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positive contexts as love, protection, paternalism, guidance, help and wise counsel- 
ing. The use of such self-reflexive terms in purportedly scientific communications is 
very undesirable. 

' While we would defend to the end the right of Carl R. Rogers“ to expound his 
own philosophical queries and experiences in relation to problems of science and 
phenomenology, we must insist that the evidential value of such observations must 
be rigorously evaluated by scientific methods before being accepted as anything but 
expressions of personal opinion and faith. The quandaries of Rogers in relation to 
“Persons or Science’’ ©) appear to be rationalizations of the deficiencies of his ideolo- 
gical approach. Rogers’ rejection of diagnosis as a basic method of research and of 
clinical practice has seriously limited the scope of his whole approach, and his failure 
to modify his position, to publicly indicate the limitations and contraindications of 
nondirectivism, or to discuss their integration with older methods, may actually 
inhibit the acceptance of the whole movement. Snyder appears to admit this in his 
somewhat despairing comment of a ‘“‘pox on both sides” as he concludes that ‘as 
= as Rogers cares and the Freudians care, both seem to be meeting the needs of 
clients.’ 

Actually, the defects of the research design under which most nondirective 
research has been done could be avoided by the simple procedure of having some 
independent authority select and thoroughly study samples of cases before submit- 
ting them for nondirective treatment. This device would simultaneously objectify 
the nature of the case materials while at the same time free the client-centered ther- 
apist from introducing any sort of diagnostic procedure during his case handling. 
After the termination of therapy, the independent authority could again study the 
cases for the presence and nature of any changes occurring from therapy. Such re- 
sults could then be compared with other samples treated by other standard methods, 


and the data subjected to factor analysis to discover any common factors under- 
lying all methods. 


Eciectic CouNSELING 


The eclectic method of selecting appropriate theories and methods to fit the 
indications and contraindications of each individual case has long been the standard 
procedure in the older clinical sciences such as medicine. In personal reaction to the 
seeming one-sidedness of nondirective and psychoanalytic approaches, publication 
of a series of articles on directive therapy was begun in 1945 and later expanded into 
a systematic textbook.“*) Our adoption of the term directive was intended only to 
emphasize the existence of a large number of therapeutic methods (including non- 
directivism) within the larger framework of all that was known about influencing 
personality. Our use of this term was not intended to imply any preference for any 
special group of methods or philosophical viewpoint, and we regret that many read- 
ers immediately projected many unintended meanings which now we do not wish to 
be identified with. The term eclectic would have been more accurate semantically 
and, by 1948, we specifically attempted to limit the term directive to the systematic 
attempt to use all methods of therapy eclectically based on a formal plan for the 
identification and modification of etiologic factors in maladjustment.“ Now, we 
prefer to abandon all terms such as directive, nondirective, psychoanalytic, etc., as 
tending to create artificial continua and semantic artifacts, and instead to substitute 
operational descriptions and nomenclature for the dynamics of any method. The 
term eclectic has value only to differentiate an approach which refuses to subscribe 
to any one operational viewpoint or method, and becomes entirely objectionable if 
given self-reflexive connotations such as ‘‘authoritarian’’. 

The eclectic approach to scientific case handling basically depends upon ade- 
quate diagnosis as the cornerstone of all case handling in order to identify etiologic 
causes and apply appropriate therapeutic methods according to their indications and 
contraindications. Currently, the most important theoretical issue in personality 
counseling and, indeed, all psychotherapy, is the matter of the relation of diagnosis , 
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to valid therapy. Far from attempting to establish any new school or system based 
on “‘directive’’ methods, our sole purpose has been to emphasize the importance of 
diagnosis as the cornerstone underlying the rationale for knowing the indications 
and contraindications of all known methods utilized without systematic bias of any 
kind. Frankly, we have been very much disturbed over the past few years by the 
discovery that many teachers and clinicians have entirely misunderstood our fund- 
emental theoretical position which stresses a thorough-going basic-science-oriented 
eclecticism by erroneously identifying us with directivism as a method or end in 
itself. Our original selection of the term “directive psychotherapy” was intended 
only to emphasize that an important group of methods exists in contrast to non- 
directivism, and it was not intended to identify a new school or system. Now the 
need for emphasizing such a difference has long since passed, and we would prefer to 
use the term directive only to indicate one end of a continuum of passivity vs. activity 
in the therapist. In our own practice, we try to be passive or active according to the 
indications of the situation rather than following any ideological bias. 

One of the criticisms advanced, particularly by nondirectivists, against eclecti- 
cism in its present status of development, relates to the voluminous research which 
has been accumulated by special schools in contrast with the meagre or nonexistent 
research output achieved by the exponents of eclecticism. Such criticisms have been 
directed against eclectic or directive counseling as where Shoben says that ‘Thorne 
is unable to escape an authoritarian tone that could not be challenged in itself if it 
were buttressed either by experimental evidence or convincing case material’’ “: P. 4), 
and which comment has appeared repeatedly in many forms. Such a criticism ap- 
pears to reflect a lack of understanding concerning the differing problems and res- 
ponsibilities placed upon those who accept eclecticism in contrast with those who 
support more limited special methods. Our original aim in attempting to compile a 
unified systematic approach to diagnosis and therapy was to attempt to gather in one 
place all that was scientifically valid and fairly well accepted from all sources or 
schools, both clinical and experimental-statistical, eclectically. We have never made 
any claims for any of these methods beyond what had been advanced in the literature 
or appeared valid in terms of personal clinical experience. We submit that this was 
all that could be accomplished at time and place, and we make no special claims for 
any superiority of any of these methods beyond what has been empirically estab- 
lished. Our role was simply to attempt an integration of all that seemed to be accept- 
ed in terms of the accumulated clinical experience of our generation and without 
favor or prejudice for any method beyond the known indications and contraindica- 
tions as currently understood. This task of attempting to integrate, in one internally- 
consistent presentation, the accumulated experience of clinical science is in itself so 
gigantic as to preclude any personal research attack upon the vast number of prob- 
lems and methods, and certainly beyond our personal resources which have been 
invested maximally in the attempt to integrate existing research interpretively. We 
can only state, then, that our conclusions represent only the concentrate of human 
knowledge at time and place as we have interpreted it. We feel that there is a definite 
need not only for researchers who can provide valid data but also for global inter- 
preters who must attempt to relate the many disparate findings which are in them- 
selves more or less meaningless until oriented in some master frame of reference. 

Any deficiencies of eclecticism or directivism simply reflect either (a) the rela- 
tively primitive state of human knowledge concerning these methods, or (b) the 
inadequacies of breadth of viewpoint among those who attempt an integrative inter- 
pretation. To the degree which eclecticism is able to integrate all operational meth- 
ods and findings available at time and place, it appears to us that it must represent 
the last word concerning what we can validly understand and apply in-practice. We 
may accept, as a basic premise, the axiom that clinical methods are simply individual 
applications (within the limits of validity of clinical judgment) of basic science ex- 
perimental-statistical methods and data. We seek to protect the validity of our judg- 
ments and conclusions by rigorously seeking to discriminate cause-effect relationships 
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by the most advanced diagnostic methods. We submit that this is all which can be 
done for time and place within any clinical orientation. Eclectic methodology is so 
diverse and complex that a general advance on all fronts simultaneously can come 
only very gradually and certainly beyond the scope of any one person or generation 
to advance them very markedly. In spite of this admitted deficiency based on the 
primitive status of existing knowledge, we can be assured of some degree of face, 
content, empirical and factorial validity by constantly seeking to relate our practices 
to the most advanced basic science thinking of time and place. 

In contrast, the proponents of special methods have a particular obligation to 
demonstrate that new ingredients have been added over and above existing knowl- 
edge. Considering the impressive basic science advances of the last century, both by 
science in general and by special schools, the burden of the proof rests upon pro- 
ponents of new schools to show where and how they have advanced human knowl- 
edge beyond merely introducing new terms or rehashing the known. Modern medi- 
cine for example, has reached a status of solid basic science achievement where we 
can expect new discoveries in special areas by experimental-statistical methods, but 
we do not anticipate any revolutionary philosophical advances which will change 
our basic methods of scientific discovery or thinking. Science supercedes schools of 
philosophy or metaphysics as soon as it accumulates sufficient objective knowledge 
in any field to provide valid bases for clinical applications. We believe that clinical 
psychological science has already achieved such a solid basic science foundation of 
proven facts as to make obsolete the need for metaphysical theorizing and special 
schools. If new advances are made, they must be integrated into the broad founda- 
tion of scientific knowledge rather than being offered as revolutionary panaceas to 
replace all that is eclectically known. 

Eclecticism in clinical practice is always held back in situations where students 
are trained in only one method, and are either oblivious of other methods or insecure 
in their application. Until such time as students are provided training experiences 
with all known operational methods of diagnosis and therapy, it is evident that their 


capacity or competence in applying all known methods eclectically will be limited. 
The need to build up one’s confidence clinically by becoming a ‘“‘master”’ in some 
specific school, such as nondirectivism or psychoanalysis, disappears as personal 
competence with all known methods broadens and one is sustained by the knowledge 
that one knows all there is to know for time and place, i.e., broad scientific knowledge 
and clinical competence reduce the need for speculative theorizing. 


DIscIPLINARY COUNSELING 


Williamson and Foley“’ have presented a detailed reporting of the use of 
“counseling” in the disciplinary management of student conduct problems as de- 
veloped since 1941 in the office of the dean of students at the University of Minne- 
sota. Although the authors evidently are aware of the classically therapeutic basic 
role of personality counseling, and repeatedly emphasize the rehabilitational and 
reeducative objectives of their program, it becomes immediately apparent that the 
term “management” would be a much more realistic designation of their methods of 
case handling than is the term “counseling’’. Although in defining their concept of 
the role of counseling in student discipline, the authors are careful to structure their 
activities as not being authoritarian, regulatory or punitive, an analysis of their out- 
line of case handling methods and the actual transcripts of individual cases indicates 
that their activities are conducted within a plainly authoritarian judgmental frame- 
work which can hardly be perceived as anything else by the students. In spite of 
what Williamson and Foley theorize the situation to involve, the critical element in 
categorizing the nature of the situation must be in terms of what the subjects of the 
counseling perceive it to be. In discussing their procedures of disciplinary counsel- 
ing, their schematic outline“: »- ® includes such steps as (a) making of charges 
against the student, (b) compulsory interviewing for counseling purposes, (c) re- 
porting and review by a disciplinary committee in a face-to-face situation with the 
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student, and (d) disciplinary action by the committee involving the imposition of 
further regulatory actions upon the student. Whatever the professed intentions of 
the disciplinary committee, the execution of its functions in such a framework cannot 
avoid the connotation of a judgmental, regulatory and, perhaps, punitive orientation 
to case handling as the whole situation is perceived by the student from the implica- 
tions of what goes on. That many students may, indeed, perceive the situation as 
basically judgmental and punitive is clearly evident in the case of Milton Reed 
(7, p. 228) where not only the accused but also two friends openly profess their fears 
that unreasonable judgments and /or punishments are going to be administered. 

In our opinion, no method of case handling deserves to be designated as counsel- 
ing unless healing is the basic and primary objective beyond which all other consid- 
erations are secondary. In order to retain its basically therapeutic functions, the 
counseling situation must be thoroughly safeguarded to preserve its purpose of being 
the one place (in addition to psychiatry) where a person may frankly discuss his 
problems in a completely healing atmosphere unthreatened by any question of 
judgment or punishment. We are in agreement with Rogers“? in his contention that 
any other structuring of the situation destroys the therapeutic context. 

Implicit in the approach of Williamson and Foley is the questionable assumption 
that misbehavior is classifiable into voluntary and involuntary types for which dis- 
cipline and therapy are the solutions respectively. While the authors recognize that 
immaturity and personality deviations may interact with unhealthy environmental 
forces to produce misconduct, the installation of such complicated judicial machinery 
and practices only succeeds in advancing the implication that we are really dealing 
with deliberate misbehavior, bordering on criminality, and which can be handled 
best by punitive procedures progressively applied until the miscreant sees the light 
and voluntarily controls himself better. We must insist that such methods of case 
handling are more properly designated as management and do not qualify as counsel- 
ing because the basic operational framework is not primarily therapeutic but rather 
is regulatory and administrative. This is not to imply that counseling cannot be 
carried on within disciplinary or administrative auspices where the healing motive is 
primary and disposition is secondary. We would prefer to base all counseling on the 
postulate that all misbehavior reflects immaturity, personality deviation or personality 
reactions to situations, reflecting various degrees of lack of control, and arising etiologi- 
cally in the domain of psychopathology. From such a viewpoint, all disciplinary prob- 
lems should be handled within a completely therapeutic frame of reference, only be- 
ing referred for regulatory action in the event of the breakdown of therapeutic pro- 
gress and poor prognosis. The authorities could take notice of misconduct by anony- 
mous comments on the situation, clearly indicating its position that such behavior 
must be the result of unhealthy thinking, and further indicating that gross regulatory 
methods would be enforced only in the event of the breakdown of rehabilitation. 

There is serious question whether disciplinary counseling as practiced in the 
case of Janet Larson’: ». 26) may be actually untherapeutic and dangerous to the 
health of the subjects concerned. We can think of few situations more traumatic 
and guilt-productive than for students to be detected in the act of sex intercourse in 
the bushes by the university watchman and such detection is usually punishment 
enough. Psychiatrically, there is great danger of the girl (at least) developing in- 
capacitating mortification and guilt reactions which may lead to life-long neuroticism 
and frigidity. When this student was interrogated by F. L. Crane in the presence of 
John D. Foley on July 22, 1945 in great detail concerning what took place, the cir- 
cumstances of the interview indicate that it was conducted somewhat like a third 
degree solely for the purpose of securing a signed confession and with nothing thera- 
peutic taking place. Further, on July 23, the accused Janet Larson and William 
Burton were called before the faculty disciplinary committee and interrogated to- 
gether about their marriage intentions, guilt reactions and penitent attitudes. The 
trend of the questioning seemed to hint that the university would be satisfied if the 
couple married (shot gun influence?) but it actually did not accomplish this purpose, 
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perhaps because Burton was led to admit previous intercourse before Miss Larson 
and also did not seem completely enthusiastic about marriage. The most therapeutic 
action taken was the decision of the disciplinary committee to put the students on 
probation while continuing at the university. It is interesting to note that on May 
11, 1946, counselor G. E. K. spontaneously notes that Miss Larson no longer has “a 
vivacious, energetic enthusiasm”’ in her manner but has changed to ‘“‘quiet winsome- 
ness”, which might be interpreted as reflecting deep anxiety or guilt. In any case, a 
perusal of the case record reveals no evidence of therapeutic counseling but only the 
administrative details of handling student misbehavior. 

An analysis of the entire 15 cases cited as examples of disciplinary counseling 
reveals that the counselor activity seems to consist in acting as the representative of 
university authority in identifying alleged misbehavior, identifying the students 
involved, obtaining signed confessions before witnesses, conducting judicial hearings, 
and later advising the student how to behave in order to stay out of trouble again. 
While some secondary therapeutic results may be achieved where leniency, tolerance 
and patience may operate to keep the student in the university until he can learn to 
handle objective behaviors more adaptively, there is little evidence either that the 
primary emphasis is therapeutic or that modern counseling methods have been used 
under maximally effective conditions to secure deep personality changes. 


DIFFERENT OPERATIONAL APPROACHES 


Order may be brought into the current confusion of competing psychotherapies 
by reducing or translating each approach to its lowest operational common denom- 
inator. As pointed out elsewhere “*: ». 8), behavior analysis may be attempted 
from the operational viewpoints of instinct theory, perceptual field theory, learning 
theory, motivation theory or the psychology of the emotions. Each diverse approach 
utilizes its own operational levels of personality study and achieves its own dis- 
tinctive results which may reflect determination by general, group or specific factors. 
Clinicians trained primarily in one operational approach tend to have trouble com- 
prehending the methods and results from other operational approaches. The ad- 
vantage of a thoroughgoing operational approach is that it permits the study of raw 
behavior data directly and with the formulation of diagnostic impressions inductively 
to fit the individual case with the progressive interpretation of data. The problem is 
to construct a theory to fit the case inductively rather than trying to apply theories 
deductively. Operationism renders unnecessary undue speculation and theorizing if 
carried out in sufficient detail to uncover the major etiologic factors in each case. 

The fact that many methods appear to produce results can only have two poss- 
ible explanations. Either similar dynamic factors are operating in diverse methods, 
or there is more than one method for achieving the same specific goals. For example, 
as pointed out by Snyder“: »- *), perhaps a theory of psychotherapy does little 
more than to inspire the therapist with confidence which is transmitted to the client 
and becomes the basis of a relationship which shows the client that “somebody” 
cares. Or perhaps suggestion effects and emotional release are enough to help most 
clients. Or perhaps merely doing something until time and nature heal is enough. 
Such important questions can only be solved with a research design complex enough 
to identify and control all relevant variables. In the meantime, symposia on per- 
ceptual field theory or learning theory in relation to psychotherapy may be very 
valuable in casting light on what these various operational approaches may contrib- 
ute but it should not be expected that any one operational approach can provide the 
complete answer. Combs®? may be correct in concluding that traditional learning 
theories offer little towards the improvement of counseling (that is, beyond the gen- 
eral recognition that all therapy involves learning according to well-known psycholo- 
gical laws), but we feel that he is overoptimistic in assuming that all behavior is a 
function of the field of perception at the moment of behaving so that the process of 
counseling becomes one of altering perceptions in desired directions. Shaw“ ap- 
parently takes a much more valid position (in agreement with our eclectic view- 
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point) that varied conditions of learning in persons may produce a variety of etiolo- 
gic factors operating on many levels of personality integration. We agree with 
Shoben“? that contemporary learning theories are most interested in principles of 
learning while the clinician must attempt, as we have also previously pointed out, to 
(a) establish suitable conditions so that learning can take place, and then (b) to 
actually introduce new conditionings in more healthy directions. “* »- 278) Our own 
detailed viewpoint is that personality integration may be organized by many levels 
of factors, which may be perceptual, learned, motivational, affective or instinctive, 
operating at various levels of generality or specificity, and each requiring their own 
particular operational methods for diagnosis or treatment. “® °-'*") It will probably 
require complex experiments of factor analytic design to differentiate prototypical 
patterns of syndrome organization. 


SEMANTIC PROBLEMS 


A most urgent problem in the whole field of diagnosis and therapy concerns the 
early adoption of a uniform and universal system of nomenclature so that the co- 
ordinates and boundaries of psychological space may become standardized. Every 
few months some authority launches a new theory replete with its own distinctive 
terminology which usually is not related to the coordinates of accepted knowledge. 
Two recent examples of such system-building with difficult new terminologies are 
Allport’s event-structure theory”) and J. G. Miller’s general behavior systems 
theory“, both of which will require long intensive study to become proficient with 
and simply to master the terms with any facility. And even after the new terminology 
is mastered, there still remains the problem of relating the new contributions to 
existing knowledge to discover whether or not the new formulation makes any dis- 
tinctive contribution. There is great need for the profession to adopt a standard 
nomenclature with a universal index of factors to be measured. The most promising 
beginnings in this direction have been made by the factor analysts who are working 
towards standard methods for labelling and mapping new factors in their space-time 
dimensions, and we have personally adopted standard terminology in our approach 
to psychological examining. Semantic chaos will result from the multiplication of 
new theories and systems unless standard nomenclature is uniformly adopted to 
relate new developments to existing coordinates. In our own editorial work, we uni- 
formly reject papers introducing new and esoteric terminology unless it can be clearly 
shown that such refinements are needed to describe genuinely new developments. As 
matters now stand, students entering the field must spend entirely unreasonable 
amounts of time trying to master the babel of conflicting terminologies. 


PREDICTIONS FOR THE FUTURE 


The most urgent theoretical question in personality counseling and, indeed, all 
psychotherapy, is the matter of the relation of diagnosis to valid therapy. Currently, 
it is fashionable in some circles to ignore problems of diagnosis or to categorize such 
interests as authoritarian. ©: »- *) It is a basic postulate of all scientific method to at- 
tempt to discover the nature and etiology of all variables determining a natural oc- 
currence, 7.e. to identify causes and effects. Without in any way attempting to de- 
fend the inadequacies of current diagnostic methods and systems, we cannot see how 
any progress can be made in understanding causation or rationally manipulating de- 
terminants unless a diagnosis is made of all antecedents leading to consequences. Our 
opinion is that advances in psychotherapy must depend on more valid diagnosis and 
to this end we have evolved a systematic method “* for evaluating all factors known 
to organize personality integration. We emphatically reject the hypothesis that 
psychotherapy is in any way different from other clinical science in the sense of it 
being possible to dispense with diagnosis in scientific case handling. This has nothing 
to do with any philosophic bias but is a tenet of scientific method. 

Within this general basic science orientation, we may predict future develop- 
ments in the issues which are now so controversial. We predict that the current 
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heavy investment of time and energy in speculative theorizing and system-building 
will be supplanted by a consistent effort to relate all knowledge and new develop- 
ments in terms of a standard nomenclature and universal index of factors and results. 
We predict that exclusive preoccupation with special operational approaches such as 
perception or learning theory will be replaced by a broader eclectic utilization of all 
available operational approaches according to the indications and contraindications 
of time and place. We predict that nondirectivism (and all other special therapies) 
will be demoted from their current status as mutually exclusive systems to take their 
proper places as limited technical advances integrated into a broad diagnostic-thera- 
peutic armamentarium applied eclectically, without special bias or preference for 
specific tools. We predict that much of psychoanalytic speculation and dialectics will 
be abandoned, either because ultimately proven to be unvalidatable or more parsi- 
moniously explained in terms of basic science concepts, and leaving a residual of fact 
which can be integrated into basic science knowledge in the proper places. We pre- 
dict that all clinicians will accept ultimately the absolute essentiality of diagnostic 
thinking and will cease to seek short-cuts motivated by current deficiencies. In other 
words, we reaffirm our belief in the ultimate victory of scientific methodology in 
proving its capability of dealing with all psychological problems without recourse to 
less rigorous philosophizing and speculation. 

We cannot escape the fact that all natural phenomena reflect the operation of 
laws which determine causes and effects. If we are to understand and manipulate 
these causes and effects with maximum efficiency, then we must recognize the various 
categories of action imperatives involved in the concept of “‘ought’’. Elsewhere “*?, 
we have explored the implications of the psychology of ‘‘ought”’ for social planning in 
general, and individual efficiency in particular, with special reference to what can be 
accomplished in diagnosis and therapy. We insist that all this can be accomplished 
within the orientation of eclecticism. We hope that adherents of various special 
positions will cease projecting various characterizations upon each other and will 
work in common to discover and control the actual etiologic factors which must be 
operating. 
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PROBLEM 


Cronbach and Meehl®) have examined the construct of validity in psycho- 
logical testing and discussed four types of validation—namely, predictive validity, 
concurrent validity, content validity and construct validity. The first two kinds of 
validity can be distinguished from the last two in terms of criterion-oriented valida- 
tion procedures. The present paper is concerned with construct validation which “‘is 
ordinarily studied when the tester has no definite criterion measure of the quality 
with which he is concerned and must use indirect measures.’ ®: ». 282) As Cronbach 
and Meehl have indicated, in studies of construct validity the trait or quality under- 
lying the test is of central importance, rather than either the test behavior or the 
scores on the criteria. In their paper they mention ©: »- *°°) that many types of evi- 
dence are relevant to construct validity, including content validity, interitem correla- 
tions, intertest correlations, test ‘‘criterion’’ correlations, studies of stability over 
time, and stability under experimental intervention. 

The present study is an attempt to evaluate certain aspects of the construct 
validity of the Sixteen Personality Factors Test principally by means of correlating 
its factors with the better-known scales of the MMPI. Specifically, the following 
questions were investigated: What are the relationships between MMPI scale scores 
and 16 P. F. factor scores? Do the two forms of the 16 P. F. Test correlate highly 
enough to be used as equivalent forms as they were intended, 7.e., what are the 
coefficients of equivalence of the individual factors? Further, what is the relation- 
ship between scores on factor B (general intelligence) and scores on individual tests 
of intelligence such as the Wechsler-Bellevue Intelligence Scale Form I (W-B I) and 
the Wechsler Adult Intelligence Scale (WAIS)? Finally, an attempt was made to 
determine whether experienced clinicians are capable of predicting which factors 
from the 16 P. F. would be significantly correlated with each of the MMPI scales. 
The design of this study should make apparent which areas of personality the 16 
P. F. Test and the MMPI measure in common as well as those areas which are 
idiosyncratic to each. 


PROCEDURE 


The sample consisted of 71 USAF officers on flying status who were referred to 
the School of Aviation Medicine for medical and psychological evaluation. The 
average age was 33, with a range of 20 to 39 years, and the mean number of years of 
education was 13, with a range of 12 to 16 years. The mean Full Scale IQ on the 
Wechsler Adult Intelligence Scale was 121.98 with a standard deviation of 6.50. 
The patients were referred for evaluation between August, 1955, and July, 1956, 
and were individually administered the group form of the MMPI which contains 
566 items by the Department of Neuropsychiatry, and Forms A and B of the 16 
P. F. Test by the Department of Clinical Psychology. Usually no more than 3 days 
elapsed between the administration of these two tests. The two forms of the 16 P. F. 
were alternated so that one half of the sample had Form A first, while the other half 
had Form B first. The 16 P. F. was given as a part of the routine diagnostic psycho- 
logical testing battery. Both forms of the test were usually administered on the same 


1The data for the study were collected and the statistics computed while both authors were at the 
School of Aviation Medicine, USAF. 

2Appreciation is expressed to Lts. John K. Sussenberger and Gordon Raynor for their aid in the 
administration and scoring of the tests, and to Dr. S. B. Sells for suggesting the comparison between 
the 16 P. F. Test and the MMPI. 
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day with the form taken first given in the morning, and its counterpart in the after- 
noon. The sample employed in this research is believed to be typical of the flying 
personnel who are referred for consultation in the United States Air Force. 

The tests were scored according to the directions given in their respective man- 
uals: 4: 1), The raw scores for Forms A and B were converted into standard scores 
by using the sample of male college students and high school seniors whose average 
age was 21 as the standardization group. Pearson coefficients of correlation within 
and between the two tests were computed, using the raw scores of the 14 MMPI 
scales and the standard scores of the 16 P. F. Test. 


RESULTS 


In every case high scores on the MMPI were correlated with high scores on the 
16 P. F. Test. It should be noted that on some of the factors of the 16 P. F. a high 
score does not have a negative connotation as it always does on the MMPI. The 
matrix of intercorrelations is presented in Table 1. The large number of significant 
correlations obtained between the 16 P. F. factors and MMPI scales (87 out of 244 
or 39%) demonstrates there is a common area of overlap between these tests of 
personality. As might be expected, fewer significant correlations occur among the 
16 P. F. factors (39 out of a possible 120 or 32%) than among the MMPI scales (61 
out of a possible 91 or 67%). These results are a function of the factor analytic con- 
struction of the 16 P. F. as compared with the strictly empirical method used in the 
construction of the MMPI. 

Looking now at the significant relationships which the MMPI and 16 P. F. 
Test (Forms A and B combined) share with one another, it can be observed from 
Table 1 that apparently the latter test adequately taps all of the areas of personality 
measured by the MMPI with the exception of the Pa and ? scales. Each of the other 
12 MMPI scales is significantly correlated with a minimum of four factors from the 
16 P. F., even though the MMPI is the larger of the two tests by approximately 
30%. When the MMPI scales are matched with the factors from the 16 P. F. Test, 
7 of the 16 factors are correlated significantly with 3 or fewer scales of the MMPI— 
namely, factors A, B, E, G, N, Q; and Q2. Apparently the dimensions of personality 
measured by these factors are not well represented in the MMPI. These results ap- 
pear to demonstrate certain advantages in economy of test length, as well as in 
breadth, for the factor analytically constructed 16 P. F. Test over the empirically 
constructed MMPI. 

The direction of certain of the specific relationships observed in Table 1 are 
notable. Curiously enough, L scores on the MMPI correlate positively with scores 
on factor C (ego strength), and negatively with factors O (anxious insecurity) and 
Q, (nervous tension). Further, Hy correlates positively with factor N (sophistica- 
tion) and Mf correlates positively with factor B (general intelligence), negatively 
with factor F (surgency) and positively with factor Q. (independent self-sufficiency), 
while Ma correlates positively with factor L (suspecting, jealous). The other 79 
significant correlations between the two tests reported in Table 1 all appear to be in 
the direction to be expected on the basis of the descriptions presented in the respec- 
tive manuals and from clinical experience as well. 

Coefficients of equivalence for the 16 factors of each form with order of adminis- 
tration controlled, are reported in Table 2. With Form A administered first, 9 of the 
coefficients fall below .50, while only 2 of the factors (H and Q:) exceed .70. While 
with Form B administered first, 11 of the reported coefficients fall below .50, and 
only one factor (H) exceeds .70. For the total sample, average Pearson coefficients 
of correlation between the two forms indicate that they are not equivalent enough 
to be considered satisfactory alternate forms. Even when coefficients of equivalence 
are computed with the Spearman-Brown prophecy formula for determining the 
effects of doubling the length of the test, 6 of the reported coefficients still fall be- 
low .50, and only 5 of the factors (A, H, O, Q2 and Q,) exceed .70. These results sug- 
gest that for accurate individual diagnosis in clinical work, neither of the test forms 
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should ever be used alone, and further, that even when both tests are administered 
together, as recommended by the authors in their manual, the reliability of factors 
B, C, E, I, L, M, N and Q; is still low. In this connection it should be noted that in 
Supplement A to the test manual “?, the test authors themselves report that 10 of the 
16 coefficients of equivalence for the 16 factors are less than .50, even after having 
been corrected by the Spearman-Brown formula to full length. 

Having used the 16 P. F. Test as a part of the diagnostic psychological test 
battery in a clinical situation for over a year, it had been the authors’ contention 
that factor B (general intelligence) was actually not a valid measure of intelligence, 
at least with the intellectually superior sample of patients seen in the clinic. This 
hypothesis was capable of being tested since scores from both the Wechsler-Bellevue 
Intelligence Scale, Form I (W-B I) and Wechsler Adult Intelligence Scale (WAIS) 
were available on 50 of the 71 patients who had taken the 16 P. F. Test. The results 
presented in Table 3 show that factor B, on both Form A and B, does not correlate 
significantly with the averaged Wechsler scores. The Verbal IQ’s from the Wechsler 
tests were used in the computations in order to maximize the possibility of obtaining 


significant relationships. Evidently factor B is not a valid measure of intelligence in 
an intellectually superior sample.* 


TABLE 3. CORRELATIONS BETWEEN Factor ScoRES ON THE SIXTEEN P. F. Test anp AVERAGED 
W-B I anp WAIS Versat I. Q. Scores (N = 50 





Sixteen P.F. Test r* Verbal I.Q. Sixteen P.F. Test r* Verbal I.Q. 
Form A Factorst with Form A Factors Form B Factorst with Form B Factors 





12 
16 
.03 
—.07 
21 
—.26 
-.15 
-03 
12 
-06 
-.19 
18 
-09 
-06 
—-.07 
-.05 


01 
19 
-.01 
05 
14 
—.05 
—.05 
-.15 
—.16 


1 
2 
3 


owe 


A 
Bt 
Cc 
E 
F 
G 
H 
I 
L 
M 
N 
Oo 
Q 
Q 
Q 
QA 


LELGOZZM “mas w> 





*Pearson r must equal .28 to be significant at the 5% level of confidence. 


+For these 50 cases order of administration on the Sixteen P. F. Test was alternated in order to balance 
the effects of scale. 

{Factor B scores were also correlated separately for both forms with the following results: With Form 
A given first, the Pearson r between Verbal I.Q. with factor B on Form A was .22, while the correlation 
with factor B on Form B was .07. With Form B administered first, the Pearson r between Verbal 


I.Q. with factor B on Form A was .13 and the correlation with factor B on Form B was .30. (N for 
each group = 25). 


As a further means of evaluating the construct validity of the 16 P. F. Test, each 
of the scores on the remaining 15 personality factors on Forms A and B was also cor- 
related with the averaged W-B I and WAIS Verbal IQ scores. The results, shown in 
Table 3, demonstrate that none of the personality factors from either of the two forms 


sWhen the 16 P. F. Test (Forms A and B combined) and the short form of the Otis Self Adminis- 
eee See of Intelligence, Higher Examination, Form C were administered to a random sample of 96 
USAF basic airmen, the following coefficients of pega were obtained: r Otis 1.Q. wit — 
A = -.16,B = 49,C = .28, E = .13, F = .24,G = .12,H = .06 " 


N = .18,0 = .24,Q: = -.13, Q = .13, Q = gu and Q = -.13. For significance at the 1% and 
5% levels of confidence, Pearson r’s must ‘equal 26 and .20 
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of the 16 P. F. Test correlates significantly with intelligence. These findings lend sup- 
port to their construct validity as personality factors, and suggest that this test may 
be of use in certain selection situations in which nonintellective factors are import- 
antly related to criterion measurements. 

In order to determine whether experienced clinicians were capable of predicting 
which factors from the 16 P. F. Test would correlate significantly with each of the 
MMPI scales, five judges were used to make predictions. Two of the judges were 
psychiatrists and the other three were clinical psychologists. All of the judges had 
worked with the MMPI, while only 3 of the judges (the psychologists) had any ex- 
perience at all with the 16 P. F. Test. Each of the judges was given a folder which 
contained an MMPI manual, the manual for the 16 P. F. Test, a sheet of paper on 


which each of the MMPI scales were listed, and a page of instructions which read 
as follows: 


“We are attempting to do a study which involves establishing the relation- 
ships between scales on the MMPI and those on the Sixteen Personality Factors 
Test having common psychological rationale. Your task involves taking the 
sheet of paper on which the fourteen (14) MMPI scales are listed and next to 
each of them putting the factors, or factor, from the 16 P. F. which you believe 
should be significantly related in either the positive or negative direction. Please 
indicate the direction you predict by placing a + or — after each factor. This 
should be done carefully, since on some of the scales of the 16 P. F. Test a high 
score does not have a negative connotation as it always does on the MMPI. 
Observe that on ten of the 16 factors (notably A, B, C, E, F, G, H, N, Q: and 
Q;) high scores are desirable, while on the remaining 6 factors (namely, I, L, M, 
O, Q: and Q;) low scores appear to be desirable. In every instance the direction 
you posit should be in terms of high scores on both the MMPI and 16 P. F. To 


facilitate this task you are being provided with a manual for each of the two 
tests.” 


The results presented in Table 4 demonstrate that none of the individual judges 
was able to predict a greater number of significant relationships than could be ex- 
pected on a chance basis. Moreover, even when the selections were limited to those 


TasBie 4. Cnr SQuARE VALUES FoR Eacu JUDGE 





Judges* 
A B Cc D E 
Number of correct predictions 13 8 12 15 20 


Total number of predictions 29 13 20 33 55 
Chi Square** 0.41 2.50 0.88 0.35 0.02 








*The selections of each judge were compared with the actual number of significant correlations 
out of the total number of correlations computed between the two tests, i.e., 87 out of 224. 
**With d.f. = 1, chi square must equal 3.84 for significance at the .05 level of confidence. 


predictions which showed a minimum of 3 or more agreements among the 5 judges 
(10 correct out of 16 agreements), the Null hypothesis could not be rejected (X? = 
1.60, P = >.20). These findings indicate the great need for further research on 


problems which give consideration to the clinician as well as to his measuring instru- 
ments “* 17), 


Discussion 


Special consideration should be given to certain relationships between the 16 
P. F. Test and the MMPI which were expected to occur, but which did not. All five 
of the judges were in agreement in predicting significant correlations between D and 
factor F (depressive anxiety), Pt and factor C (ego strength), Ma and factor F, Pa 
and factor L (paranoid schizothymia), and Se and factor A (cyclothymia). Only the 
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first three of these relationships were realized in the matrix of intercorrelations given 
in Table 1 while the last two were not. Four of the five judges predicted significant 
relationships between D and factor O (anxious insecurity), Pd and factors G (super- 
ego strength) and M (hysteric unconcern), and Mf and factor I (emotional sensitiv- 
ity). All of the correlations predicted were realized except for that between Pd and 
G. Finally, three of the five judges concurred in predicting significant relationships 
between Hs and factor Q, (nervous tension), Hy and factors M (hysteric unconcern) 
and C (ego strength), Sc and factor H (adventurous resilience), Ma and factor A 
(cyclothymia), and Si and factors A and H. Of these seven predictions, four were 
correct, while the three between Hy and M, Hy and C, and Ma and A were not 
actualized. As Cronbach and Meeh|!®: »- © have indicated, when “‘a predicted re- 
lation fails to occur, the fault may lie in the proposed interpretation of the test or in 
the network.’’ Whatever the case, these findings indicate that some alteration in the 
“nomological network” of certain MMPI scales (Pa, Sc, Pd, Hy and Ma) and/or 
particular 16 P. F. factors (L, A, G, M and C) is necessary. It can, however, be stated 
that the present manner in which these scales and factors are described in their 
respective manuals does not enable experienced clinicians to make a significantly 
greater number of correct predictions about the relationships which exist among 
these measures than can be expected on a chance basis. 

Some mention should be made of the possibility of developing measures of sub- 
jects’ test-taking attitudes on the 16 P. F. Test. The findings presented in Table 1 
suggest that there are enough factors on the 16 P. F. Test which correlate significant- 
ly with the L, F and K scales of the MMPI to make such an undertaking a distinct 
possibility. This is regarded as an important problem in clinical work since it is pre- 
cisely with those patients who are reluctant to provide information about themselves 
that the busy clinician frequently turns to the personality tests for help. That the 
test authors are cognizant of some of the shortcomings of personality questionnaires 
when administered to subjects whose honesty, insight and cooperation are poor can 
be inferred from their statement that “results differ as between cooperative and un- 


cooperative, well educated and poorly educated, honest and ulteriorly-motivated 
subjects.”’ & P. 2 


SUMMARY AND CONCLUSIONS 


Some aspects of the construct validity of the Sixteen Personality Factors Test 
were investigated by means of correlating its factors with the MMPI scales and with 
averaged Wechsler-Bellevue Intelligence Scale, Form I, and Wechsler Adult In- 
telligence Scale Verbal I1.Q.’s. Coefficients of equivalence for each of the factors on 
the 16 P. F. Test were also reported. In addition experienced clinicians were asked 
to make predictions about which factors from the Sixteen P. F. Test would be signi- 
ficantly correlated with each of the scales of the MMPI. 

It is concluded that (a) both the 16 P. F. Test and the MMPI measure certain 
common areas of personality; (b) the 16 P. F. Test has certain advantages over the 
MMPI, inasmuch as it contains fewer items, but apparently possesses greater 
breadth; (c) the coefficients of equivalence of the factors on form A and B are not 
high enough to warrant their being used as equivalent forms; (d) even when the co- 
efficients of equivalence are corrected for full length by means of the Spearman- 
Brown prophecy formula, the resultant coefficients do not appear high enough to 
warrant the use of this test for accurate individual clinical diagnosis; (e) factor B is 
not a valid measure of intelligence in an intellectually superior sample of military 
flyers and appears to require redefinition; (f) some support for the construct validity 
of the other 15 factors (not including B) as measures of personality was obtained 
from the fact that none of these factors were significantly correlated with individual 
measures of intelligence; (g) certain scales on the MMPI (Pa, Sc, Pd, Hy and Ma) 
and/or particular factors on the 16 P. F. (L, A, G, M and C) appear to require re- 
definition, since in their present form experienced psychologists and psychiatrists 
are misled into making incorrect predictions about their interrelationships. 
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CONFIRMATION OF NINE PSYCHOTIC SYMPTOM PATTERNS 


MAURICE LORR JAMES P. 0’CONNOR JOHN W. STAFFORD 


Veterans Administration and Catholic University 
Catholic University 


INTRODUCTION 


The analysis to be presented is prompted by the belief that identification of the 
basic symptom patterns of the functional disorders represents an important and 
possibly necessary initial phase of research in this area. Once the basic dimensions of 
illness are isolated and quantitatively appraised, more fundamental research into the 
origin and dynamics of pathological and deviant behavior can proceed with greater 
certainty. 

Dissatisfaction with current classificatory schemes is widespread. Initially 
achieved through acute clinical observation, a form of armchair factor analysis, 
diagnostic classifications are now periodically modified and ‘‘validated”’ by majority 
vote of practicing members of professional associations. This unsatisfactory state is 
in part due to the absence of specific known pathogenic organisms, characteristic 
organic brain lesions, or particular biochemical processes to account for mental ill- 
ness. Vague definitions without upper and lower bounds are also partly responsible. 
However, much of the disagreement between experienced judges and the dissatis- 
faction in practice is accountable to the very considerable overlap of symptoms from 
one diagnostic category to the next). As a consequence a large number of mixed 
types are encountered in practice. 
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In the domain of the functional psychoses five major studies have thus far been 
reported. Using Moore’s“*? data, Degan“? isolated some nine correlated first order 
and four second order factors. Wittenborn and his collaborators“: '’) have repeated- 
ly isolated some seven independent patterns. Lorr, Jenkins, and O’Connor“)? identi- 
fied eleven first order and three second order factors. Recently Trouton and Max- 
well isolated six factors in a hospital Item Sheet. Guertin from an unrotated 
cluster analysis of a check sheet identified five correlated factors for a sample of 
schizophrenics. Clearly there is need for cross-identification of the factors isolated 
and for repetition of such studies to determine the degree of invariance of the para- 
meters postulated. Different sets of rating scales and check lists have been used to 
quantify behavior descriptions and different patient populations have been studied. 
Wittenborn et al and Trouton and Maxwell used orthogonal rotation; others have 
rotated to oblique simple structure. As a consequence there is question as to the 
number of unique factors and their relationships. 

Accordingly in this study we sought to verify by means of multiple factor an- 
alysis 13 symptom and behavior dimensions previously identified as characteristic of 
functional psychotics. The hypothesized factors and the scale code numbers of the 
defining variables were as follows: (A) retarded depression vs. manic excitement: 1, 6, 
11, 32, 41, 43, 50, 53; (B) resistiveness: 31, 35, 36, 46; (C) paranoid projection: 3, 13, 
19, 42, 45, 52; (D) activity level: 28, 33, 37; (E) melancholic agitation: 2, 7, 15, 40, 48; 
(F) perceptual distortion: 8, 16, 24; (G) motor disturbances: 9, 12, 17; (H) hostile bellig- 
erence: 18, 29, 34; (1) withdrawal vs. sociability: 30, 38, 39, 54; (J) grandiose expansive- 
ness: 14, 20, 25, 44, 49; (KK) conceptual disorganization: 5, 10, 21, 51; (L) obsessive- 
phobic reaction: 22, 26, 47, 55; (M) hysteric conversion: 4, 23, 27. 


PROCEDURE 


The total sample consisted of 116 functional psychotics from eight psychiatric 
hospitals. The sample from each hospital included open and closed ward cases, 


chronic and acutely disturbed cases, newly admitted and continued treatment cases. 
Alcoholics and neurological cases were excluded. 


Ratings were made on a schedule of 55 scales, 39 of which were taken from the 
Multidimensional Scale for Rating Psychiatric Patients and 16 from the Wittenborn 
Psychiatric Rating Scales. These are the two best known standardized rating sched- 
ules. It seemed worthwhile to include representative scales from each form in order 
to determine their equivalence. Of the 39 MSRPP scales, 27 were taken from the 
interview section and 12 from the ward observation section. Variables numbered 1 
to 39 are MSRPP scales; those coded 40 through 55 are Wittenborn scales. Both the 
MSRPP and Wittenborn scales were preceded by the full set of instructions provided 
by the authors. 

Each rater completed the interview sections following a relatively thorough 
interview with the patient. The ward behavior section was completed either by an 
experienced attendant or nurse who knew the patient well, or by the interviewer on 
the basis of the report of the attendant or nurse. 

Product moment correlations were computed between 52 of the scale variables. 
Three variables were dropped because a sizable number of patients were judged un- 
ratable. Since two of the variables omitted were introduced to define the hysteric 
conversion factor, the hysteric cluster could not be included in the study. When the 
correlations were segregated into 12 clusters, the cluster labelled hostile aggressive- 
ness (H) was found to merge with the manic excitement cluster (A). The manic 
cluster and the 10 remaining clusters of correlations formed the basis for extracting 
11 factors by the multiple group method. The resulting oblique matrix was ortho- 
gonalized and rotated by the single plane method to approximate simple structure. 
One rotated factor is considered residual and will not be described. Except where the 
pattern of correlations strongly suggests a lower limit, a cutting point of +.35 will 
be used in interpreting the factors. 
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RESULTS 
The scales correlated +.35 or higher with Factor A are as follows: 


Scales Correlation 
swears more than others 

affective responses appropriate 
raises voice 

more irritable than most 

no slowing of responses 

speech distinctly loud 

conspicuously unrestrained in feeling 
rate of speech variable 

conspicuously over-talkative 

highly ela 

distinctly tense 

slovenly and unkempt 

highly hostile 


swears less than others 

failure of affective responses 

no evidence of loudness of voice 
rarely irritable 

responses slowed or delayed 
speech almost inaudible 
conspicuously restrained in feeling 
rate of speech uniform 

taciturn or mute 

deeply depressed 

piacid or reiaxed 

neat and orderly 

bears no hostility 


PSRLTELIALTHI 


Factor A represents a bipolar parameter; a retarded depression is at one pole and a 
manic excitement is at the other. A is very similar to Wittenborn’s manic-depressed 
cluster. It is broader than the original Factor A“? since it includes the irritability, 
obscenity, and hostility of the hostile belligerence cluster. One pole seems to be identi- 
cal with Degan’s second order mania factor W which is defined by hyper-irritability 
and manic hyper-excitability®’. It appears that Factor A is a second order factor 
appearing at the first order level. 
The scales correlated +.35 or higher with Factor B are as follows: 

No. Scales Correlation 
= does + gem without urging — requires urging to do what is expected = 


reasonably cooperative — often reluctant to cooperate 
31 never sealats expected activities — often resists expected activities .36 


Factor B is identical with the resistiveness factor previously identified“. De- 
gan’s neurasthenia factor I, defined by neurasthenia (appearance of fatigue) and 
negativism, bears considerable resemblance to B. The underlying element appears 
to be a stubborn reluctance to conform to or comply with the directions of others 
which is particularly characteristic of schizophrenic patients. A resistance to any 
form of energy expenditure may explain the presence of this factor in depressive 
patients. 

The scales correlated + .30 or higher with Factor C are as follows: 


Scales Correlation 
believes people talk about him 

believes Re is conspired against 

feels conspired against 

highly hostile 

believes others influence his behavior 

thinking bizarre or obscure 

more irritable than most 

disturbed by hostile impulses 


9 no unjustified suspicions 
no unjustified suspicions 
does not feel misused 
bears no hostility 
no evidence he feels controlled 
thinking not bizarre or obscure 
rarely irritable 
no hostile impulses 


Factor C represents a paranoid type of projection clearly identical with Factor C of 
the original study “), with Wittenborn’s paranoid schizophrenia ®?, and with Guert- 
in’s persecutory suspicious factor’. However, the hostility and the disturbance by 
impulses to commit hostile acts were not present in the original C. Degan failed to 
find such a factor in Moore’s data for the simple reason that no variables defining the 
paranoid syndrome were explicitly included. 

The scales correlated +.30 or higher with and defining Factor E are as follows: 
’ Scales Correlation 

no complaint of anxiety — distressed by anxiety pm | 

no hostile impulses disturbed by hostile impulses .49 

no morbid fears disrupted by morbid fears 48 

no obsessional thinking obsessional thinking .50 

unconcerned distinctly anxious 46 

no feelings of sinfulness conviction of sinfulness 43 

no complaint of foreboding feelings of impending evil 35 
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no use of complaints frequent use of complaints 31 
seems completely worn out livelier than others —.32 
does not feel misused feels conspired against 31 
no manneristic movements frequent manneristic movements 31 
deeply depressed highly elated .30 
knows who he is and others -, knows neither self nor others .36 
thoughts and feelings consistent ‘ thoughts and feelings unrelated .40 


The parameter underlying E is a melancholy agitation. It is identical with 
Factor E of the original study“, with the addition of morbid fears and obsessions. 
Wittenborn’s anxiety factor is probably the same as E. Factor E is very similar to 
Degan’s depression factor which is defined by depression, anxiety, refusal of food, 
tearfulness, and retardation of speech action. Only the retarded element in Degan’s 
depression factor is inconsistent with the hypothesis of an anxious depression. The 
negative correlations of thought-feeling disharmony and manneristic movements 
with E were also present in the original E. These suggest that Factor E is actually 
bipolar, the negative end defining a disassociated lack of concern or apathy. 

The scales correlated +.35 or higher with Factor F are as follows: 


No. Scales Correlation 
4 reports no visions — sees and believes in visions .66 
8 reports no voices — hears and believes in voices .62 
9 knows who he is and others — knows neither self nor others 46 


Factor F was previously described as representing a pathological tendency towards 
perceptual distortion of reality. It seems identical with Degan’s hyper-projection 
factor. Three conditions may have prevented its appearance in Wittenborn’s data: 
orthogonal rotations, an insufficient number of reference variables, and incomplete 
reduction of the correlation matrix. 

The scales correlated +.35 or higher with Factor G are as follows: 


Scales Correlation 
no peculiar movements — frequent manneristic movements 53 
no bizarre postures —— persistent bizarre postures .49 
neat and orderly — slovenly and unkempt .37 
unconcerned — distinctly anxious .37 


Factor G is essentially the same as the motor disturbance syndrome G previously 
isolated. Except for the absence of mutism and refusal of food, Factor G is the same 
as Degan’s catatonia. In as much as few mute, stuporous patients were included in 
the present study it is not surprising that these elements are not represented on G. 
Wittenborn did not include scales defining manneristic postures or movements in his 
analyses. Motor disturbances, as exhibited in bizarre and repeated movements and 
postures, have been produced experimentally in animals and in humans by means of 
drugs. It is difficult to account for such disturbances at present, except as indicators 
of personality disorganization. 
The scales correlated +.35 or more with Factor I are as follows: 


Scales Correlation 
interested in nothing many interests .66 
appears worn out livelier than most .63 
actions slower than average actions faster than average 59 
underactive restless 55 


no answer if spoken to hard to - if spoken to 39 
stays by himself usually with others 387 


Factor I appears to be a combination of previously isolated factors of social 
withdrawal versus sociability (I) and activity level (D). All elements of the original 
withdrawal factor except neatness are represented here. Similarly, every element 
defining activity level except distractiveness is represented on the new Factor I. 
Guertin’s retardation and withdrawal factor is virtually identical with Factor I. 
Our hypothesis is that Factor I is a bipolar parameter of social withdrawal vs. excess- 


ive sociability. Further study with a larger number of cases should aid in testing this 
hypothesis. 
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The scales correlated +.30 or higher with Factor J are as follows: 


No. Sacles Correlation 
5 no exaggerated opinion of self — grandiose convictions 46 

14 self depreciatory — self important or conceited .30 

34 rarely irritable — more irritable than most -.31 


Factor J represents an underlying self depreciation vs. grandiose expansiveness identi- 
cal with the previously isolated Factor J. It is poorly defined in the present study but 
its meaning seems clear. In Wittenborn et al, this factor is called paranoid condition 
and is characterized by busyness, optimism, grandiose notions, and stilted speech. 
The per cent of total variance this factor accounts for tends to be small in all studies 
thus far reported. 

The scales correlated +.35 or higher with Factor K are as follows: 


No. Scales Correlation 
5 speech responses relevant — speech responses irrelevant .69 

21 speech coherent — speech scattered and incoherent .62 

43 ideas change normally — rapid or abrupt change of ideas .53 


Factor K is indicative of thinking disorganization. It appears to be the same as the 
factor called conceptual disorganization in the earlier study“). It was hypothesized 
that thought and feeling disharmony would help define this factor; this does not 
occur. A very similar syndrome is called schizophrenic dissociation by Degan be- 
cause of specific disturbances in the association of ideas and incongruity of affective 
responses. Guertin’s personality disorganization factor is very similar to Factor K. 
In a recent yet unpublished study, a rerotation of Wittenborn and Holzberg data 
by one of the authors also revealed a conceptual disorganization factor. It was de- 
fined as irrelevant and incoherent speech, memory faults, variation in rate of speech, 
unawareness of the feelings of others, and inability to make simple decisions. 
The scales correlated +.35 or higher with Factor L are as follows: 


vo. Scales Correlation 
55 no compulsive acts — compulsive acts occur daily 55 
26 no compulsive acts — resists performance of compulsive acts .43 


On the basis of the Wittenborn-Holzberg findings, a phobic-compulsive syndrome 
was hypothesized. Since Factor L is defined only by two very similar scales it may be 
that it is simply a doublet. Compulsive acts are not commonly found in psychotics. 
In fact a phobic-compulsive syndrome is ordinarily regarded as characteristic of 
neurotics. If few or no severe neurotics were included in the present sample studied 
we would not expect a phobic-compulsive syndrome. Furthermore, it is not un- 
common for raters to confuse compulsive acts with manneristic stereotyped move- 
ments. Perhaps this is why compulsive acts helped define a catatonic excitement in 
the original Wittenborn analysis“. It may be that similar conditions resulted in 
Factor L in the present study. 


Second Order Analysis. The correlations between the primary vectors were clustered 
and two factors were extracted simultaneously. The orthogonalized matrix was then 
rotated. Second order Factor X, shown in Table 1, evidently represents a manic 
excitement vs. retarded withdrawal with motor disturbances. A related factor, CC, 
identified as agitated hyperactivity vs. apathetic confused withdrawal was isolated 
in an earlier study“. 

Factor Y is defined at one end by grandiose expansiveness, paranoid projection, 
resistiveness, and a lack of concern. Self depreciation, a conforming cooperativeness, 
and a depressive agitation characterize the opposite pole. A similar paranoid- 
depressive Factor Y was identified by Degan. Only the grandiose element is missing 
from Degan’s second order factor. The underlying psychological mechanism seems 
to be extropunitiveness vs. intropunitiveness. The patient either blames others for 
his difficulties or blames himself. 
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DiIscUssION 

Of the 13 syndromes hypothesized, 9 appear to have been confirmed. Hysteric 
conversion did not appear because two of the three defining variables could not be 
included in the study. Instead of a phobic-compulsive syndrome, a compulsive 
doublet emerged from the rotations. Activity level combined with social withdrawal, 
while hostile belligerence combined with manic excitement. Since the latter has also 
been identified by Degan as hyper-irritability, and by Guertin as excitement-hostil- 
ity, it seems likely that some restriction in the nature and size of the sample ac- 
counts for its disappearance in this analysis. 


TaBLe 1. Roratep Seconp OrpER Factor Marrix 
(Decimal points are omitted.) 








Second Order Factors 
xX Y 


4 


a? First Order Factors 


Retarded depression — manic excitement 54 
Cooperativeness - resistiveness -13 
No projection paranoid projection -21 
No agitation melancholy agitation 00 
No distortion perceptual distortion 05 
No disturbances - motor disturbances —52 
Withdrawal — sociability 

Self-depreciation grandiosity 

No disorganization conceptual disorganization 

No conversion hysteric conversion 

Residual 66 








Each of the factors confirmed has now appeared in at least three independent 
analyses. Different rating schedules or check lists were used, and substantially 
different patient samples were studied. This suggests that the underlying para- 
meters hypothesized may be used with considerable confidence. They represent use- 
ful constructs for describing and studying psychiatric patients. 


SUMMARY 

This study sought to verify by means of multiple factor analysis 13 symptom 
dimensions previously identified as characteristic of functional psychotics. A sample 
of 116 psychotics from eight hospitals were described on 55 rating scales taken from 
two well known rating schedules. The scales were intercorrelated and factored by the 
multiple group method. Of the 13 syndromes hypothesized, 9 were confirmed with 
only minor modifications. Two second order factors extracted were also found to 
resemble factors previously isolated. 
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THE LOYOLA LANGUAGE STUDY* 
VINCENT V. HERR 
Loyola University, Chicago 


PROBLEM 


The Loyola Language Study (LLS) is a modified form of the Kent-Rosanoff 
Association Test. Although free and controlled association tests have long been 
regarded as being among the most important instruments for measuring personality, 
very little actual research evidence has accumulated concerning their reliability and 
validity. Rosanoff et al‘ published the largest amount of data on the Kent-Rosan- 
off Association Test including tables of frequencies of popular and rare responses. 
Unfortunately, associative responses are largely determined by local and cultural 
meanings so that Rosanoff’s frequency tables are of questionable value and validity 
for contemporary American groups. Malamud‘’? conducted validity studies on 
Maller’s controlled association test, and concluded that both validity and reliability 
were satisfactory, even for individual diagnosis, when proper systems of weights are 
used. Goodenough ©’ recommended such tests for personality studies, and Crown“? 
used the Maller test for distinguishing neurotics from normals. Our purpose has 
been to bring the association test up-to-date for current usage. 

The problem in the present investigation was to ascertain whether the com- 
munality of the associations of hospitalized patients differed significantly from that 
of normals on the LLS. Specifically, schizophrenic responses would be expected to 
differ from normal in such a controlled test. Earlier investigations concentrated 
upon the reaction time for free association, and on the singularity of the responses 
which the subjects gave. The present investigation concentrates upon the fact of 
deviation from communality of response, and attempts to quantify these deviations. 
An essential prerequisite was to devise a satisfactory system of scoring communality 
of response without which small differences might be obscured. 


METHODS 


Test. The Loyola Language Study (Fig. 1) consists of eighty words chosen at 
random from the Kent-Rosanoff list in order to shorten the testing time to 30 min- 
utes. The subjects were instructed to pause and consider what most people would be 
likely to think when confronted with each of the eighty stimulus words. The assump- 
tion was that psychotics in general, and schizophrenics in particular, would not be as 
capable of complying with this type of instruction and of finding common responses 
as the normals would. 


Subjects. Since the Kent-Rosanoff frequency lists for free association were based 
on samples from the eastern United States, it was decided that the Loyola Language 
Study should be standardized separately for different areas. The present investiga- 
tion began with normal and abnormal samples drawn from the Boston metropolitan 
area, was repeated in the Chicago area, and is projected for other areas in the United 
States. 

Because age and education might affect the quality of the responses, a stratified 
random sampling technique was used in establishing norms for the general popula- 
tion. For the Boston experiment, the census statistics for the Boston metropolitan 


*This project was sponsored in part by a grant from the Department of Welfare of the State of 
Illinois. 
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soldier (F) 
hungry 
butterfly 
long 

head (M, F) 
anger 
afraid 

fruit 

dark (M) 
ed 


r 
loud (F) 
bath 


high (F) 
white 
command 


sour 
king (M) 

eep 
sleep (F) 
black (M) 
hammer (F) 
table (M, F) 
thirsty (M, F) 
quiet 


working 
comfort 
soft (M) 
short 
beautiful 
cold (M) 
whiskey 
yellow 


window (M, F) 


scissors (M) 
foot (M, F) 
doctor (M, F) 


sickness (M) 


mountain (M, F) 


stove (F) 

girl 

salt 

man 

cheese 

baby 

moon (F) 
spider 

bread (M, F) 
whistle (M, F) 


dream (F) 
trouble (F) 
religion (F) 
street 
health 
ocean 

bed (F) 
child 


tobacco (M, F) 


woman 
cabbage 
citizen (F) 


hard wish (M) carpet earth 

blue house (F) needle (M) lion 

sweet (M) justice hand (M) butter (M) 
stomach (M, F) river thief (M) music 





area for 1950 were consulted ; counts were made of the numbers of persons in the gen- 
eral population with education at each possible level from fifth grade to sixteen years 
of education, and at each age level from nineteen years to sixty. Proportional num- 
bers were used for the age and education sub-groupings in this study. Similarly, 
census statistics for the Chicago metropolitan area for 1950 were consulted; propor- 
tional numbers of subjects were chosen for the cells in the Chicago study. Four 
hundred males and 400 females were sufficient for a twelve-celled distribution, with 
four equal divisions with respect to education and three equal divisions with respect 
to age. However, since the frequencies of males differed from those of the females in 
any one particular cell, it was necessary to proceed with the two sexes separately in 
each area. Only persons who had been reared in the United States were included in 
the normative samples. 

The hospitalized psychotic patients to be investigated in each geographical 
area were matched, person by person, with normals from the same area with regard 
to both age and education, because the possible effects of age and education were not 
known at the start. Patients were tested individually whereas normals were tested in 
groups of twenty to thirty. The psychiatric patients in the Boston area were in- 
mates of a state hospital who had been diagnosed as predominantly schizophrenic. 
Seventeen percent of them were manics and two percent involutionals. The patients 
from the Chicago mental hospital had all been diagnosed as schizophrenics. In 
Boston, 50 male and 50 female patients served as subjects; in Chicago 56 males and 
53 females. None of the patients had undergone shock treatment within three months 


prior to testing. , 


Systems or Scorine 

Systems of Scoring. Carter and Sorensen“: ») have already studied the problem 
of scoring free and controlled association responses, and have concluded that in- 
significant differences result from different methods of scoring. Three systems of 
scoring were employed in the present investigation. 

In establishing norms for the general population, Boston males, Boston females, 
Chicago males, and Chicago females were treated as four separate categories. A 
frequency count was made, for instance, of the responses of the 400 Boston males to 
each of the 80 stimulus words. Thus it was established that, for example, 169 of the 
400 Boston males responded to the word “hammer” with the word “nail”. Per- 
centages were then determined for each response to each stimulus word, and dis- 
tributions of percentages were reduced by taking the square root of each. This pro- 
cedure aligned all the responses of the normative sample on a descending scale, ac- 
cording to the amount of agreement among the normative group in choosing a parti- 
cular response. The resultant psychological continuum, from low to high, based 
upon agreement of choices among the normative subjects, is the same kind of con- 
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tinuum involved in previous studies on communality of association response. After 
the mean root frequency and the standard deviation for each stimulus-response dis- 
tribution had been calculated, standard scores were computed, with an arbitrary 
mean set at 20 and a standard deviation at 10. This method was thought to be justi- 
fied because all the distributions were of the same shape and were skewed in the same 
direction. It had the advantages of an equal-interval scale, and independence of 
means and variances. 

The final goal of scoring a test of communality must be to secure a single score 
for each subject for all his responses, a total score which may be taken as an indicator 
of the success that he has on repeated trials in finding common responses. Conse- 
quently, after the procedure described above had been carried out, test booklets of 
the 400 individuals in each of the four categories of subjects, namely the Boston men 
and women, the Chicago men and women, were scored according to the standard 
score scales determined from the frequency counts. 

Although the standard score method just described had its advantages, it was 
costly in time and effort. A simpler method of scoring was also employed. This 
simpler method utilized double-root percentage-frequency values, so that the 
resultant scale values gave to each response to any given stimulus a value absolutely 
determined by its raw frequency. Thus no account was taken of the relationship 
between any given frequency and the other frequencies in the same distribution; no 
account was taken of the varying top frequencies from stimulus to stimulus. The 
Chicago samples of 800 normals and 109 patients were rescored by this method. A 
difficulty arose from the use of the absolute root-frequency scales. The variances 
turned out to be proportional to the means. This would preclude the use of certain 
statistical procedures with the data. 

A still simpler type of scoring was attempted with the same 800 Chicago normals 
and 109 Chicago patients. Starting with the response of highest frequency to any 
given stimulus, as many other responses (in descending frequency) were accumulated 
to the highest response, as would be necessary to include the responses of 50% of the 
population. Any subject who chose a response that fell within the upper 50% of the 
normative sample was credited with a score of 1 for that item; any who did not 
choose a word from among them received a score of zero. For example, two res- 
ponses to the stimulus “house’’, namely “home” and “family” together netted 51% 
of the total female normative group. Choice of either of these responses would merit 
a score of 1, and choice of any other would merit zero. When this method of ‘‘median”’ 
scores was used in scoring the eighty items, some patients scored near zero, and since 
it was planned to shorten the test later, it was not deemed advisable to work with 
the median scores. Actually the correlation between the standard scores and the 
doubled-root scores, for the same 800 subjects, was .96 and the correlation between 
the standard scores and the median scores was .93. Any of the three systems of 
scoring yielded differences between patients and normals which were significant be- 
yond the .001 level of confidence. Yet somewhat different cutting points between 
patients and normals resulted if one system was used rather than another. The dis- 
cussion of results which follows is based upon the standard scores only. 


RESULTS 


The test booklets of 800 Chicago normals and 800 Boston normals, of 100 Boston 
patients, and of 109 Chicago patients, were scored with the standard scores. Then 
53 Chicago female patients were matched for age and education with 53 normals 
selected from the 400 females; 56 Chicago male patients were matched with 56 
normal Chicago males; 50 Boston female patients with 50 Boston normal females; 
and 50 Boston male patients with 50 Boston normal males. The matching was 
person to person, and the limits of the differences allowed were one year of age and 
one year of education. The mean difference between normals and patients was in 
each case significant beyond the .001 level of confidence. But there was considerable 
overlap on the extremes. In order to ascertain which items contributed most to the 
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differences, an item analysis was made on all 80 items, comparing the mean score 
for each item made by the normals with that made by the patients. At least 25 items 
were significantly different at the .01 level of confidence for each of the four compari- 
sons. The booklets were then rescored using only the 25 items which showed the 
maximum difference. After computing new means and standard deviations, it be- 
came apparent that the shortened test had a much higher screening efficiency than 
the longer test. Fig 1 contains the list of the 80 stimuli and the top 25 items for 
Chicago men are marked with M, those for the Chicago women, with F. Only eleven 
of the top items are common to both Chicago men and women. The same is true for 
the Boston men and women. Just half of the top screening stimuli are common to any 
one sex, regardless of the area of residence. 


Validity Coefficients. An index of screening efficiency, according to a method devised 
by Eysenck®: *? was computed for the total scores on the shortened tests. It con- 
sists in subtracting the proportion of normals incorrectly identified from the pro- 
portion of patients correctly identified. The indices are as follows: Chicago men .64, 
Chicago women .70; Boston men .70, Boston women .71. 

Another method of estimating validity was used. Phi coefficients for each of the 
25 items were computed and converted into product-moment coefficients, on the 
assumption of continuity between normals and patients on communality of thought. 
The median coefficients were as follows: Chicago men .45, Chicago women .66; 
Boston men .65, Boston women .68. The shortened test has concurrent validity, 
since it distinguishes the patients from the normative groups. 

As a final check on validity, the ratings of psychologists on 50 Boston female 
patients were secured from the hospital. Ratings ranged from 0 - 80 and were nearly 
normally distributed. The product-moment coefficient of correlation between the 
ratings and the scores on the shortened test was .48, which is significant beyond the 


.01 level of confidence. The shortened test seems suitable for differentiating between 
various degrees of mental illness. 


Reliability Studies. Reliability was estimated in two ways. The internal consistency 
was measured by correlating the odd-numbered items with the even-numbered items 
for samples of 400 men and 400 women in groups of 100 each. The split-half reliabil- 
ity when corrected by the Spearman-Brown formula ranged from .88 to .94 with a 
median at .92. 

The test-retest reliability was also considered. This was obtained by administer- 
ing the test twice to the same persons, at intervals of time from 4 to 8 weeks. The 
correlation for eighty items was .49, and this value did not change when the odd items 
of the test were correlated with the even items of the retest. When the total length 
of the test was reduced by scoring only the items which had top screening efficiency, 
the correlation between test and retest rose to .55 for a sample of 82 male and female 
subjects. Other groups are being tested in order to confirm this reliability figure. 
Guilford tells us that both reliability and validity cannot be maximum in a test 
which has items with a variety of difficulty and there must be a compromise between 
the two. The values for both presented in this study fall well within the suggested 
ranges for tests of this type. 

The factors of age and education might be thought to influence the performance 
on the LLS. In order to offset the influence of these variables, matched groups were 
used for the item analysis of the test protocols. The four total stratified samples of 
400 each were used and a coefficient of correlation computed between age and the 
Language score. The results are as follows: Chicago men, --.22, Chicago women, 
—.23; for 400 Boston men, —.29, and Boston women, -.31. The relationship between 
age and the scores was not rectilinear, and the coefficients varied from one subgroup 
to another. As an example, taking the group which had three years of high school, 
age correlated with the LLS —.31 for the Chicago men; but taking the group which 
had two years of college, the correlation was —-.12. The coefficients are slightly lower- 
ed by using only the top 25 items in scoring. With these reduced scores, tables are 





262 VINCENT V. HERR 


being prepared which will enable investigators to make the small corrections for 
age variance. 

On the other hand, the LLS correlated positively with years of education and 
the coefficients are as follows: Chicago men .21, Chicago women .22; Boston men 
.17, Boston women .18, and all these coefficients are significant at the .01 level of 
confidence. However, the general trend observable among the normals for education 
to be related to better LLS scores is not found among the patients. For some un- 
known reason they neither show consistent losses due to age nor consistent gains 
due to education. The present samples were too small to give significance data. 

The intelligence factor was also considered in the present investigation. No 
intelligence scores were available for the large stratified samples, but the positive 
correlation already mentioned between education and the LLS score suggests a re- 
lationship between intelligence and the LLS. In order to eliminate the possible effect 
of varying amounts of education on the LLS, two homogeneous groups were selected 
for study. The one consisted of 100 senior high-school boys who had been tested with 
the Henmon-Nelson test, Form C; the other of 100 male college freshmen and 100 
female college freshmen who had been tested with the American Council on Educa- 
tion entrance examination. For the high-school senior boys, the correlation with the 
LLS was .11, significant at the .30 level, whereas for the college males it was .06 and 
for college females , .04. The correlation between the ‘‘L’’ score on the ACE and the 
LLS score was .06 for the males and for the females. Apparently intelligence apart 
from education is a negligible factor in communality of thought. 


SUMMARY AND CONCLUSIONS 


The Loyola Language Study distinguishes between schizophrenics and normals, 
using any of three systems of scoring, namely standard scores, doubled-root fre- 
quency scores, and median scores. The screening efficiency of the test is increased by 
using only high validity items scored by standard scores. Such scores also permit of 


parametric statistical operations. There is a tendency for older persons to make 
poorer scores, and for this effect to be counteracted by education. Intelligence, 
apart from age and education, is not a significant factor. 
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PROBLEM 

Many writers have called attention to the problem of language disturbance 
associated with psychiatric disorders“: *: * 1. 12, 18.16). Goldstein and others“: *: ¢ 7 
4) have shown that impairment of the abstracting function is found in both schizo- 
phrenia and organic involvement. Psychiatrists have long associated proverbs with 
the measurement of verbal abstraction, leaning heavily upon Benjamin’s®> list. 

In 1956, Gorham“: ® reported a Proverbs Test which offered three parallel 
forms of a clinical test with an objective method of scoring including norms for 
normals and schizophrenics. A multiple-choice form of his test made group adminis- 
tration and clerical scoring possible. Gorham“®) demonstrated that either form of 
the test would differentiate a normal and chronic schizophrenic population of male 
veterans at a high level of statistical significance. This finding raises two questions: 
Are the results equally applicable to male and female populations? Would this ap- 
parent indication of impairment in abstract thinking be reflected by a greater loss if 
the test were applied to organic cases? It is the purpose of this study to use the 
Proverbs Test to test the hypothesis that there is a drop in level of the abstracting 
function from normal to chronic schizophrenic to organic subjects. Both sexes will 
be investigated. 


A DESCRIPTION OF THE PROVERBS TEST 


The clinical Proverbs Test consists of three parallel forms containing 12 pro- 
verbs each, scaled in order of difficulty. Samples are: 
Where there’s a will there’s a way. 
Barking dogs seldom bite. 
A stream cannot rise higher than its source. 
Let sleeping dogs lie. 
The wife is the key to the house. 
Great bodies move slowly. 


An abstract (Ab) score on this form of the test is secured by rating answers accord- 
ing to sample scores provided for each proverb. Each answer is rated 0, 1 or 2 points; 
the total possible score being 24. 

The multiple-choice Proverbs Test consists of forty items with four answers each. 
It is scored by sten¢il. An abstract (Ab) score consists of the total number of correct 
responses. A concrete (Co) score consists of the number of items marked from 18 
““concrete’”’ answers. Samples of an easy and a difficult item are: 


1. The sun shines on all alike. 


a. It’s the same sun everywhere. 

b. All are created equal. (Ab answer) 

c. The sun shines on everybody. (Co answer) 

d. People that do the same things are alike. 

Crooked logs make a straight fire. 

a. Even the humblest can create things of beauty. 

b. Handicaps may be used to advantage. (Ab answer) 
c. There is some good in everybody. 

d. Things will come out all right in the end. 


The clinical manual® for the test provides norms for schizophrenic and normal 
populations including mean scores by intelligence levels. 


'This research was supported by a grant from the Hogg Foundation for Mental Hygiene, Austin, 
Texas. The study was submitted by Mr. Elmore in partial fulfillment of the requirements of the 
M. A. degree at Baylor University. 
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DESIGN 


The essential design of this study consists of matching cases for intelligence in 
order to compare Proverbs Test scores by sex and diagnostic group. The index of 
intelligence used was the Word Knowledge Test of the Airman Classification Bat- 
tery.?. Individual male and female subjects of chronic schizophrenic and organic 
diagnoses were matched within one stanine score on the vocabulary test with normal 
subjects. The patient population was secured from Texas state hospitals. Those 
selected had been hospitalized for more than five years and had not received active 
treatment (electro-convulsive, insulin coma, or tranquilizing drugs) within the prior 
year. The organic group had been diagnosed as “organic without functional psy- 
chosis’”’ and otherwise met the same specifications as the schizophrenic group. The 


control population was drawn from a large normal population of industrial and office 
workers, laborers, etc. 


RESULTS 


Three measures of abstraction were secured from the two forms of the Proverbs 
Test; an abstract score and a concrete score from the multiple-choice test and an 
abstract score from the clinical test. The mean scores on each of these measures is 
shown in Table 1. This table confirms the hypothesis of this study: that the ab- 


TABLE 1. MEAN Scores ON THE Proverss Test (N = 25 ror ALL Groups) 








Multiple-choice Clinical test 
; test scores Form I 
Population Group Abstract score Concrete score Abstract score 





Normal 
Males 22. : 11.4 
Females . ; 9.4 


Schizophrenic 
Males , 1.6 
5 


Females 4 ; 2:! 
Organic 

Males 3 8 1.2 

Females 2.5 5 1.9 








stracting function drops from normal to schizophrenic to organic subjects. It will 
be noted that the concrete scores increase as the abstract scores decrease. It is 
assumed by the authors that these two measures are representative of two aspects 
of the same intellectual process, viz., verbal abstraction. ‘‘Concrete’’ thinking is con- 
sidered to be the opposite of, or the inability to do abstract thinking. This assump- 
tion is substantiated by the negative correlation (—.58 for 100 normal subjects and 
~.54 for 173 chronic schizophrenic subjects) found by the author in a previous 
study“. Further statistical analysis of the data indicates which scores differentiate 
various diagnostic groups with statistical significance. 


Multiple-Choice Abstract Score. The variation in the abstract scores on the multiple- 
choice test was analyzed by sex, vocabulary and diagnostic grouping. The findings 
are presented in Table 2 and show no significant sex differences, that is patients or 
normal subjects of either sex make approximately the same scores on the Proverbs 
Test whether divided by vocabulary level or diagnostic category. It can also be in- 
ferred that the diagnostic categories are independent of intelligence as measured by 
vocabulary level. The close relationship of vocabulary level and the Proverbs ab- 
stract score confirms Gorham’s‘*? earlier findings based solely on male military per- 
sonnel. The important finding in terms of this investigation is the highly significant 
variation of Proverbs abstract scores between diagnostic categories which are in- 
dependent of sex and vocabulary level. This finding directly supports the hypothesis 


*Used by permission of the AF Personnel and Training Research Center, Lackland Air Force Base, 





MEASURING THE IMPAIRMENT OF THE ABSTRACTING FUNCTION 


TaBLe 2. ANALYSIS OF VARIANCE FoR ABSTRACT SCORE ON THE MULTIPLE- 


Cuotce Proverss TEst 


Source of Variation df. 
Sex 1 
Diagnostic category 2 
Vocabulary level 24 
Sex x Diagnosis 3 
Sex x Level 24 
Diagnosis x Level 48 
Sex x Diagnosis x Level 48 
Total 149 








*Significant at the .001 level of confidence. 


out schizophrenia and organic involvement result in impairment in the abstracting 
unction. 

To determine where the variation existed between the three diagnostic categor- 
ies, the t test was applied with values as follows: normals and schizophrenics, 7.21; 
normals and organics, 6.53; schizophrenics and organics, .47. The first two ¢ values 
are significant at the .001 level of confidence while the third one is obviously not 
significant. These data indicate that the abstract score of the multiple-choice test 
tends to show a striking impairment of abstraction between normals and both psy- 


chiatric groups. It does not supply any evidence of a greater loss for the organic 
group. 


Multiple-Choice Concrete Scores. The application of Friedman’s chi-square test to the 
concrete scores yielded a chi-square value of 14.23 which is significant at the .001 
level of confidence. The ¢ test was considered to determine which groups could be 
separated by the concrete score. Because of the large number of zeros this test could 
not be applied to the normal group. Fortunately, however, the ¢ test could be used 
in comparing the scores of the schizophrenic and organic groups. The value of ¢ was 
5.2 which is significant at the .001 level of confidence. The concrete score, therefore, 
supplies the needed index for differentiating these two groups which could not be 
distinguished by the abstract score. 


The Clinical Test: Form I. Analysis of the scores of the clinical test was made difficult 
by the high incidence of zero scores among both schizophrenic and organic patients. 
The test scores were ranked and the Friedman chi-square test was applied with a 
resulting chi-square value of 43.1, which is significant at the .001 level of confidence. 
An inspection of the mean scores of the clinical test in Table 1 is sufficient to reveal 
that the differentiation in mean scores is between normals and organics or schizo- 
phrenics. This test might have some value, then, in distinguishing normals from 
chronic psychiatric patients. It throws little or no light on the hypothesis being 
investigated in this study. 


DIscussIoNn 


It was the assumption of this study that the clinically discernable impairment 
of the abstracting function found in schizophrenic and organic patients could be 
demonstrated quantitatively by a test of verbal abstraction. The results of the 
study have justified this assumption. The experimental design provided a rigid test 
of the hypothesis, since the factors of sex and intelligence were controlled in order to 
isolate the variation which was due solely to diagnostic grouping. 

The two scores, abstract and concrete, of the multiple-choice test are most use- 
ful in differentiating between the three diagnostic groups. Either of these scores will 
differentiate at a high level of confidence between the normal and either of the two 
psychiatric groups. The concrete score makes the added contribution of separating 
the schizophrenic and organic groups at a high level of confidence. Since this form 
of the test is suitable for group administration and scoring by stencil, the potential 
usefulness for research of this measurement of the abstracting function is suggested. 





266 CLYDE M. ELMORE AND DONALD R. GORHAM 


A point of theoretical interest is the negative relationship between the abstract 
and concrete scores of the Proverbs Test. This fact strongly suggests that the ab- 
stracting function and concreteness of thinking may actually be opposite portions of 
an abstract-concrete continuum, rather than two dichotomous thinking qualities. 
The findings of this study point to this type of modification of intellectualization in 
the process of impairment which occurs in certain cases of schizophrenics and organ- 
ics. The abstracting attitude of normals apparently gives way to a greatly diminish- 
ed abstraction with associated concreteness in chronic schizophrenics compared with 
almost complete substitution of concreteness for abstraction in organics. 

The clinical form of the test does not lend itself as well for this type of study, 
partly because a concrete score is not readily derived and partly because the material 
elicited by this test is most significant when considered qualitatively as well as 
quantitatively. 


CONCLUSIONS 


1. A rigid experimental design was used to test the hypothesis that clinically 
discernable impairment of the abstracting function could be measured quantitatively. 
Normal, schizophrenic and organic subjects were matched for intelligence and com- 
pared by sex and diagnostic groupings. 

2. The abstract score of the multiple-choice Proverbs Test differentiated be- 


tween normals and either psychiatric group at the .001 level of confidence. It failed 
to differentiate organics from schizophrenics. 


3. The concrete score of the multiple-choice Proverbs Test differentiates be- 
tween normal] and psychiatric patients and in addition distinguishes between chronic 
schizophrenic and organic groups at the .001 level of confidence. 


4. The findings of the study confirm the hypothesis that a drop in level of the 
abstracting function from normal to chronic schizophrenic to organic groups can be 
demonstrated quantitatively. 
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EVALUATION OF INTELLECTUAL POTENTIAL 
IN HEMIPLEGIC INDIVIDUALS 


FRANKLIN C. SHONTz! 
Highland View Hospital, Cleveland, Ohio 


PROBLEM 


Practically none of the currently popular psychological tests provides norms for 
the evaluation of hemiplegic individuals; and, if they did, such norms would prob- 
ably be practically useless since almost all available tests of intellectual capacity 
utilize either verbal communication or performances requiring manual dexterity on 
the part of the subject. The construction of specialized norms, therefore, would 
often imply a contamination of the underlying factor being measured with the 
physical effects of the disability itself. For the psychologist this currently means it 
is practically impossible to make statements about a physically disabled person which 
clearly differentiate between the person’s ability to express his capacities and the 
underlying extent of the capacities themselves as potential psychological resources. 
Further, it is currently impossible to make statements about the disabled individual 
in terms of his own group. Both are serious problems in a rehabilitation setting, be- 
cause actual rehabilitation potential probably depends far more upon internal 
capacities than upon the physical ability of the patient to express himself intelli- 
gently on typical psychological tests. Success in a rehabilitation program, designed 
by experience to meet the needs of the physically disabled, probably does not re- 
quire what would usually be called a ‘“‘normal’’ level of abilities. How well a hemi- 
plegic performs in rehabilitation, for example, may be much more closely determined 
by his abilities relative to other hemiplegics, than by his abilities relative to the 
general population. 

The hemiplegic (especially the right hemiplegic) is in a particularily difficult 
situation. Organic brain damage affects both his intellectual functioning and his 


physical means of expression. The right hemiplegic has usually lost the functioning 
of his dominant body side and may, in addition, have particularly disturbing prob- 
lems with verbal expression. Usually on superficial examination, right hemiplegics 
appear to be more seriously disturbed than do left hemiplegics. Therefore, it is of 
especial importance that our evaluation of these people be in terms of norms based 
on their own group and of tests which minimize their physical, though not their 
intellectual, loss. 


METHOD 


Procedure. Two tests usually used for children were administered to a sample of 50 
mixed hemiplegic patients. These tests require only gross arm movements as res- 
ponses, thereby minimizing the effect of the hemiplegic’s loss of dexterity and poss- 
ible verbal-expressive difficulties. One test, the Knox Cubes from the Arthur Point 
Scale of Performance tests“), requires the subject to repeat tapping patterns demon- 
strated to him by the examiner on a set of 4 stationary blocks. No speech is used and 
the tapping itself requires only gross arm movements. Primarily, this test is thought 
to measure immediate memory, or recall. The other test, the Columbia Mental 
Maturity Scale®), was designed for use with handicapped children. It requires the 
subject to pick from successive series of pictures the one out of three, four, or five 
stimuli which “does not belong” with the others. The task, therefore, seems to in- 
volve abstraction and discrimination, although it is often presented as a measure of 
intelligence. 

The Knox Cubes series were administered twice, once preceding, and once 
following the Columbia Mental Maturity Scale. Both tests were scored in the man- 
ner prescribed by their originators. 


1Thanks are due to Marilyn Weston and Robert Postel for their assistance in collecting data for 
this research. Additional data available from the author on request. 
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Subjects. The sample was composed of 50 institutionalized hemiplegic patients, 
representing almost all the testable and clearly hemiplegic patients in a 450 bed hos- 
pital for the chronically ill. Of the 50 subjects, 17 were right hemiplegics (10 males, 
7 females), and 33 were left hemiplegics (17 males, 16 females). The mean age was 
63.0 years with a standard deviation of 9.47. There was no statistical relationship 
between the side affected by the hemiplegia and either age or sex; nor was there any 
relationship between age and sex in the sample. The group was somewhat heavily 
loaded with left hemiplegics; but, whether this is representative of the general popu- 
lation of hemiplegics is not known. 

The Knox Cubes and Columbia Mental Maturity Scale were administered to 
all 50 subjects. In addition, data were available for 31 of these subjects on four sub- 
tests of the Wechsler Memory Scale: Mental Control, Logical Memory, Digit Span, 
and Associate Learning. Five of these 31 subjects were so verbally disturbed, how- 
ever, that they could not score over 2 points on all four Wechsler subtests. Therefore 
it was necessary to determine all relevant results both with and without these five 
subjects. In general, their inclusion exaggerated all other findings slightly, probably 
by virtue of their extreme position with respect to the rest of the distributions. There 
was no evidence that the sub-group taking the Wechsler Memory Scale was in any 
other way different from the sample as a whole. 


RESULTS 


The Columbia Mental Maturity Scale and the Knox Cubes tests were found to 
correlate about .60 with each other, the correlation varying from .58 to .61, depend- 
ing upon the group or sub-group involved. This correlation is significant at a proba- 
bility level of .01. 

The Columbia Mental Maturity Scale and Knox Cubes correlated .47 and .42, 
respectively with the subtests of the Wechsler Memory Scale. Both correlations are 
significant at the .05 level, and both correlations dropped slightly below the level of 
statistical significance when the 5 extremely disturbed cases were removed from the 
sample tested. No significant differences were found between right and left hemi- 
plegics on the Columbia Mental Maturity Scale or the Knox Cubes. 

Right hemiplegics were found to obtain significantly lower scores (p < .03) than 
left hemiplegics on the combined subscales of the Wechsler Memory Scale. Remov- 
ing the Digit Span subtest, however, reduced biserial correlations to chance levels and 
eliminated the obtained difference between groups. No significant relationship was 
found between age and performance on any of the three research measurements. 

Frequency distributions for the Columbia Mental Maturity Scale and the 
Knox Cubes tests were substantially normal, although the distribution for the Knox 
Cubes test skewed slightly to the left. In approximate mental age terms, both groups 
averaged about seven years, and the Knox Cubes gave a somewhat greater spread of 
scores than did the Columbia. These results fall about four and one-half years below 
what an extrapolation of Wechsler’s approximations“: »- 226) might lead one to ex- 
pect for this age-group. Statistics describing these frequency distributions are pre- 
sented in greater detail in tables 1 and 2. 


TaBLe 1. PsrRceNTILE DistRIBUTION OF KNox CupEs AND CoLUMBIA 
Menta Maturity Scate Scores 








Raw Score 
Raw Score (Columbia Mental Maturity 
(Knox Cubes) Scale) 


Percentile | 
| 11.25 69.5 
| 





10.25 63.5 
8.25 55.5 
5.50 49.0 
3.25 44.0 
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TasB_e 2. DistripuTION oF Z-ScorEs FoR KNox CuBES AND COLUMBIA 
MeEnTAL Maturity Scape 











Raw Score 
Raw Score (Columbia Mental Maturity 
(Knox Cubes) Scale) 





13.25 77.0 
10.50 66. 











CONCLUSIONS 


The Columbia Mental Maturity Scale and the Knox Cube tests are appropriate 
instruments for the evaluation of the intellectual capacities of hemiplegic patients. 
They do not require the use of expressive language, nor do they make demands for a 
high level of manual skill. They do not seem to be affected by the side on which the 
hemiplegia occurs, and the materials were not found to be insulting to the adult in- 
tellect. None of the subjects in this sample were found to be untestable on these 
examinations, and both tests are easy and quick to administer. The tests are highly 
enough correlated to warrant the assumption that both measure the same thing, yet 
the correlation is not sufficiently high to allow for prediction in the individual case. 

On the other hand, the Wechsler Memory Scale does seem to penalize certain 
subjects with verbal-expressive problems; and it tends to be more difficult for right 
than for left hemiplegics, especially the Digit Span sub-test. A few subjects were 
found to be almost completely untestable with the Wechsler subtests, yet these same 
subjects were found to demonstrate measurable intellectual and memory ability on 
the other two examinations. 
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THE PREDICTION OF ADJUSTMENT OVER A FOUR 
YEAR INTERVAL* 


HARRY BEILIN 
Institute of Child Welfare, University of Minnesota 


INTRODUCTION 


This paper reports one aspect of the Nobles County (Minn.) Mental Health 
Survey which has been underway since 1949. The general investigation had three 
main aims: (1) to undertake a study of the prediction of adjustment over time, (2) 
secure age data on personality and adjustment measures, and (3) develop screening 
instruments for locating children and adolescents in need of special attention. The 
first of these aims, the prediction of adjustment, will be reported on here. The pre- 
dictions cover a four year period and the results to be reported are for the men in 
the sample. The problems of the present prediction study, can be reduced to (1) 
the development of predictors, (2) the development of criteria, and (3) the relation of 
predictors to criteria. 


METHODS 


Development of Predictors. In studies of the prediction of mental health, difficulties 
arise from the lack of clarity and diversity among conceptions of its nature and from 
the variety of approaches to its measurement. Variables effective as predictors may 
be quite different in character from the phenomena being predicted. This is particu- 
larly true if the predictor variables are not operative over the period from prediction 
to validation and if behavioral discontinuities exist to encourage what might be 
evaluated as desirable or undesirable at one period of time (e.g., childhood or ado- 
lescence) to be judged the opposite in another period (e.g., adulthood). As an illus- 
tration, dependence upon parents is a desirable feature of childhood but undesirable 
if characteristic in adulthood. 

With knowledge of “age-related” and “‘age-free’’ behavior quite limited, the 
choice of relevant predictor variables is facilitated in one of two ways or in a com- 
promise of these: in the framework of a theory or by strictly empirical means. In 
the present study, the predictor instruments were designed to reflect, in general, 
the “personal’’ and “social” adjustment of the subject at the time of testing. The 
behavior areas included were based on previously established relationships or 
assumptions having a fair measure of face validity. The instruments can be classed 
in two groups: those to which the subject responds and those employed by his teach- 
ers to evaluate his behavior. The 1950 predictor instruments completed by the 
pupils were of the inventory, sentence completion, rating scale and nominating form 
types. They include measures of the child’s relations to his family, social responsi- 
bility, personal habits and skills about the home, play and recreational activities and 
interests, and a scale of psychoneurotic items. 

The instruments employed by teachers included a nomination form (of the best 
and poorest adjusted), a Thurstone type rating scale of responsible and irresponsible 
behaviors, and a personality rating scale. 

In 1950 the psychological instruments developed or adapted by the Institute 
staff were administered to 3200 children and adolescents between the ages of 9 and 
18 in all the schools (public and parochial) of a Minnesota county. Ratings by 
teachers were obtained from the administration of the aforementioned instruments. 


*This study has been fey eat in part by research grant M-690 from the National Institute of 
Mental Health, U. 8. Public Health Sarton and in part by funds from the Institute of Child Welfare, 
University of Minnesota. The principal investigators are J. E. Anderson and D. B. Harris. The 
author’s tenure with the study was from 1 

A major portion of this report was resented in the symposium “The Prediction of Adjustment”’ 
at the 1956 meetings of the Midwestern Fechologienl Association in St. Louis. 
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Concurrent validation of the 1950 measures was accomplished by using as 
criteria the Juvenile Court judge’s nominations of delinquents and teachers’ des- 
criptions of these youngsters. The instruments as a group were sufficiently dis- 
criminating, employing these criteria, to suggest the value of predictive validation. 
For prediction two indices were developed. The Pupil Index is a composite score 
derived from the group of instruments to which the youngster responds.! The 
Teacher Index is a composite score resulting from four teacher’s rating instruments. 
In the prediction study, high standard scores (7.0 - 9.9) have been used to predict 
good adjustment, low scores (0.1 - 3.0) to predict poor adjustment, and middle scores 
(4.0 - 6.0) to predict adequate, but neither good nor poor adjustment. In this study 
poor adjustment is more broadly conceived than as involving neurotic or psychotic 
behavior. It includes, for example, behavior which might involve the individual 
with the law or with therapeutic or custodial agencies. 


Development of Criteria. In selecting the conceptual criteria for the present study “ 
we were interested in assessing the following: (1) absence of mental disease or be- 
havior disorder, (2) normality of behavior (7.e., social adjustment, adjustment to 
environment, etc.), (3) unity of personality (7.e., acceptance of self, ego strength, 
value development, etc.), (4) adequate perception of reality (of self and external 
world), and (5) developmental resolution of conflict (the epigenetic principle of 
Erikson) “’. Our view has been that no one operational criterion would be adequate 
to reflect these essentially conceptual criteria and as a result a variety of criteria 
have been employed in the investigation as described below. | 


1. The interview criterion. The interview samples a number of areas germane to 
individual adjustment including: school adjustment, use of leisure, job adjustment, 
marital adjustment, social adjustment, family adjustment, and personal adjust- 
ments. The semistructured interview starts in each major area with a non-directive 
lead and “priming” questions follow to insure coverage of each area. During the 
interview the subject is rated as to his adjustment on a series of continua correspond- 
ing to the priming questions. Qualitative data relevant to each of the ratings are also 
recorded. In addition, at the termination of the interview the interviewer records a 
score (from 0 to 9) which he feels would best reflect by rating the adjustment of the 
person interviewed. This is denoted as the global estimate. 

During the interviews two kinds of ratings were recorded. One indicated how 
the interviewer felt the subject was perceiving himself (e.g., with regard to social 
adjustment). In this case the interviewer estimated how the subject would rate 
himself had he the opportunity to do so. This is referred to as the interviewer’s 
estimate of the subject’s self rating. The sections containing these rating scales were 
compiled in a single form and administered to the subject after the interview proper. 
It provides a score—the subject’s self rating. The object of this procedure is to assess 
understanding and empathy by comparing the subjects’ self rating with the inter- 
viewers’ estimate of that rating and thus provides a reliability check. Another sec- 
tion of ratings reflects the interviewer’s own reaction to the subject. These are more 
in the form of summary judgments for each content area—and are denoted as the 
interviewer’s rating. 

The interview, then, contributes two criteria: the interviewer’s rating and the 
subject’s self rating. The global estimate, in substance, is similar to the interviewer’s 
rating and serves the same function although it involves no continuous rating. The 
interviewer’s rating of the subject’s self rating is included as indicated as a reliability 
measure. 


2. The inventory criterion. The adjustment inventory chosen as a criterion 
measure had to be capable of administration in a brief period prior to the interview. 
The morale scale of an instrument employed in studies by Rundquist and Sletto“ 


1The indices resulted from averaging standard scores on each of the instruments and restandard- 
izing the averaged scores. 
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was revised for this purpose. The resulting 20 item instrument had not been re- 
validated prior to the present study, however. 


3. The community reputation criterion. To obtain data concerning the subjects’ 
social adjustment, some 50 adults in Nobles County communities who had familiar- 
ity with the subjects were interviewed. They ranged from gas station attendants to 
community leaders. Each judge was given a list of subjects’ names and uniformly 
asked how he felt the subjects were “getting along’. Responses were recorded by 
the interviewer and later content analyzed and scored. The scores were derived so 
that each judge was his own criterion for good and poor adjustment. This was 
accomplished by having the judge nominate the 3 best and 3 poorest among those 
he had previously described, and indicated the bases for the nominations, These 
provide the anchor points for scoring the individual reputations and serve in addi- 
tion as community criteria of mental health. 


RESULTS 


For the validation phase, four years subsequent to the original predictions, a 
sample of 76 males was interviewed and other data concerning them collected. The 
subjects are approximately equally distributed among the three (high, middle, low) 
predictor groups. Those interviewed were among those not in military service, moved 
from the county, or away at colleges too remote to be contacted @?. 

In relating the predictors to the criteria, three kinds of analyses were under- 
taken: (1) calculation of the rank order correlation between predictors and cri- 
teria, (2) calculations of the percent hits between those predicted as high, middle, and 
low, and those judged as high, middle, and low according to the criteria, and (3) 
computation of the intercorrelations among the criteria. The results of the inter- 
correlations are presented in Table 1. 


TaB.E 1. CorrELATIONS (RANK ORDER) AMoNnG 1950 PrepicTors AND 1954 CRITERIA OF 
ADJUSTMENT 





3 4 5 6 7 9 
Comb. _Inter- Global Rund-  Repu- Inter- 
P&T _ viewer’s Esti- uist- tation j viewer’s 
Index Rating mate letto Score Estimate 
Score Rating of Self- 

Rating 





PREDICTORS 
1. Pupil Index AT* ‘ F ‘ .0% .29* 
2. Teacher Index . ; i ‘ ’ 13 
3. {Combined Pupil me. ' . r ‘ , a 
Index and 
Teacher Index 
CRITERIA 
4. Interviewer’s ‘ ; ‘ , .70* 
Ratin 
. Glob , : ; .69* 
Estimate 
. Rundquist- : ‘ .37* 
Sletto Score 
. Reputation 
Score 
. Subject’s .@l* 
Self Rating 





+Pupil Index and Teacher Index weighted equally 
*Significant at .01 level 
**Significant at .05 level 


The relationship of the 1950 predictors—the Pupil Index (PI), the Teacher 
Index (TI), and an arithmetic averaging of these (PI + TI) to the criteria just out- 
lined should provide some indication of the predictive value of these measures. The 
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results indicate the size of the correlations with the criteria range: for the Pupil 
Index, from .02 (subject’s self rating) to .35 (interviewer’s rating); the Teacher Index 
correlates from —.18 (subject’s self rating) to .41 (community reputation score). The 
combined indices range from —.06 to .35. For test validation purposes these correla- 
tions though positive are relatively low. There is reason to believe, from other 
evidence, that having taken sample slices from our original population for the present 
study some correlations will be higher by about .20 for the sample than for the popu- 
lation. This fact, however, will have little effect upon the conclusions drawn because 
the size of the correlations obtained from the investigation are in the main already 
low. 

From an analysis of hits in which the percentage of direct hits (t.e., predicted as 
high, middle, or low and judged 4 years later as high, middle, or low according to the 
particular criterion) was determined, one would conclude that the predictive value 
of the instruments using the present criteria is essentially as already indicated. With 
the Pupil Index, Teacher Index and combination of these as predictors and the inter- 
view rating as criterion there are from 41% to 47% hits. Against the subject’s self 
rating the hits range from 29% to 33%; against the Rundquist-Sletto revision from 
24% to 25%, and against the community reputation scores from 19% to 21%. 


DIscussION 


We feel the explanation of these results should not be ascribed simply to the 
inadequacy of the instruments or the criteria, although this may be, at least in part, 
true. The possibility exists that the results are a consequence of changes in the atti- 
tudes and behaviors of the individuals about whom predictions were made. In the 
community reputation interviews, subjects have been described as very well adjusted 
as adults in contrast to what they had been in adolescence often enough to be skep- 
tical of prediction measures which rest on'a premise of consistency of behavior. Our 
observation is that a variety of behaviors characterize adolescents and children 
which for some will change, others not. To make more accurate predictions, then, 
from childhood or adolescence to adulthood will require more understanding of the 
developmental aspects of adjustment; in particular those characteristics which re- 
main stable and those which change and under what circumstances. 

A very different interpretation however is also possible, 7.e., that the ‘“‘basic”’ 
personality structure, if known, would provide a stable and more adequate guide to 
prediction. The instruments in this study have not done as well as they might, the 
argument could go, because they have not adequately assessed this basic structure. 
This may be, but only from further empirical research will this be known. To date, 
studies in which predictions are based upon expected consistency of behavior from 
the time of prediction to the time of validation have been lacking in spectacular 
success. 

An interesting relationship appears in the correlations of the predictive indices 
to the interview and.community criteria. The Pupil Index (the composite of psycho- 
logical instruments) predicts higher (.35) on the interviewer’s rating (the psycholo- 
gist’s criterion) than on the community reputation criterion (.29). The Teacher 
Index (which includes nominations of best and least well adjusted, and personality 
ratings) conversely predicts higher on the community reputation criterion (.41) than 
on the psychologist’s criterion (.18). In other words, for the men studied here, the 
psychologist’s predictor predicts better against a psychologist’s criterion and the 
non-psychologist’s predictor predicts better on a non-psychologist’s criterion. The 
combined indices predict equally on both criteria (.35, .32). 

Of the present criteria, the subject’s rating of his own adjustment (the subject’s 
self rating) is predicted with the lowest correlations by both the Pupil and Teacher 
Indices (PI .02; TI ~.18). The community reputation and interviewer’s ratings are 
about equally well predicted, with the highest correlations obtained (PI vs. inter- 
viewer’s rating: .35; TI vs. reputation score: .41). The Rundquist-Sletto instrument 
is somewhere between these (PI .15; TI .05). 
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It might appear from the correlation of .91 between the interviewer’s rateng and 
the global rating that being so highly correlated one could be discarded, probably the 
interview rating since it involves much more computation. There would be no hesita- 
tion in suggesting this, were it not that the global rating was made only after the 
interviewer had made the individual item ratings, though without knowledge of the 
total rating score. 

The correlation of .61 between the subject’s self rating and the interviewer’s 
estimate of the subject’s self rating can be regarded as reasonably high, considering the 
possible influences which might attenuate this relationship as, for example, the 
tendency for a subject to perceive the meaning of some items differently from the 
y 0 sea and the occasional inconsistency between written and verbal pro- 

uction. 


SUMMARY AND CONCLUSIONS 


For the purpose of predicting adjustment of children and adolescents two 
series of instruments were developed utilizing self-ratings and teacher evaluations. 
Three predictive indices were computed from these instruments and related to cri- 
terion measures of adjustment (interview, inventory, self-ratings and community 
reputations) four years subsequent to the original testing. The data reported are for 
a selected sample of male adolescents who in the validation phase were young adults. 

The correlations of predicfive indices to validating criteria range from —.18 
to .41, and in percentage “‘hits’’ from 19% to 47%. Considering the general lack of 
data concerning base rates in prediction of mental health (rather than mental dis- 
order) it is difficult to adequately assess the success of the present predictions in 
light of the essentially “healthy” character of the present sample. The possible 
limitations of prediction studies based upon a premise of behavior consistency from 
the time of prediction to validation are discussed. 
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FACTOR ANALYSIS OF RORSCHACH SCORES OF SCHIZOPHRENICS 
BERNARD A. STOTSKY 
Boston University 


PROBLEM 

Factor analysis of Rorschach scores has attracted a number of investigations 
concerned with analysis of conventional scoring categories“: ¢: 5 ® 1. 12, 14, 15) though 
two have dealt with content “: ' and one with diagnostic signs“. Such studies have 
generally found a productivity factor. Wittenborn“* using college undergraduates, 
discovered three additional factors: one heavily weighted on W, K, and CF; a second 
on M, m, Fc, C’, FC, P, and O on which FC was more highly related to M than to CF 
and C; and the third on P. His original findings were supported in a second study on 
a mixed group of psychiatric patients“®) and by Williams and Lawrence“ on a 
different psychiatric population who also found a distinct intelligence and a shading 
factor. Combining the Rorschach and MMPI they again obtained a productivity 
factor and two factors, labelled ‘‘ego-strength’’ and “‘expressive-repressive”’, which 
showed overlap between the two tests“). Coan“, analyzing twelve determinant 
scores from Wittenborn’s original data, found seven factors none of which could be 
considered a productivity factor. He criticizes previous studies on two major 
grounds: (a) The productivity factor is an artifact resulting from the inclusion of 
total number of responses (#) as a scoring category. Since the correlations of R 
with any location or determinant scores are really part-whole correlations, R may 
be expected to correlate spuriously with many of the scores. Omission of R does not 
eliminate productivity entirely since other variables like F and D may be heavily 
weighted with this factor. It does, however, remove a source of spurious correlation.! 
(b) Derivation of location and determinant scores from the same set of responses is 
another source of spurious correlation and increases the experimental interdepend- 
ence of certain categories. 

A third criticism applicable to all but two studies“! ” is the questionable use of 
Pearson r in view of the short range of most Rorschach scores and the very high 
frequency of zero scores on most variables. Many variables have either curvilinear or 
point distributions. Tetrachoric r seems somewhat more appropriate for distribu- 
tions which can be dichotomized, but assumes underlying continuity, normality, and 
linearity of regression. Categories with more than half of their subjects giving zero 
responses cannot meet these criteria. Most of the remaining categories for which less 
than half of the subjects give zero responses also show considerable skewness but 
come closer to meeting the assumptions for correlational techniques. 

Consequently the following requirements were set up for the factor analysis of 
Rorschach data: (1) Elimination of R from the correlation matrix. (2) Treating loca- 
tion and determinant variables in separate matrices and omitting such content res- 
ponses as animal, human, popular, and original responses. (3) Including only those 
variables for which more than half of the subjects gave one or more responses. 


METHOD 

Subjects. Although schizophrenics have been frequently studied by means of the 
Rorschach, none of the factor investigations has been on this diagnostic group. The 
author used Rorschach data from a previous study of 148 diagnosed schizophrenics in 
two VA neuropsychiatric hospitals. These schizophrenics were divided into two 
groups of 74 each. One group remitted and was later discharged. The second group 
was continuously hospitalized for two years after admission. Testing had been per- 
formed soon after admission. 


Procedure. The protocols were scored according to Beck’s earlier procedures ®?. 
Reliability of scoring had been established in the previous study“. Using the med- 
ian as a cutting point, fourfold tables were set up for each pair of variables being 


1Percentage scores are unusable because of their ipsative nature. 
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correlated: W, D, and Dd for location; F, M, FSH*?, FC, and CF for determinants. 
Tetrachoric r’s were then calculated from computing diagrams“). The resulting cor- 
relation matrices were factored by the Thurstone centroid method. Factoring was 
continued until a communality smaller than .25 was obtained for a factor. The 
centroid factors were then rotated orthogonally using the criteria for simple struc- 
ture. These operations were performed first for the entire sample, then for both sub- 
samples. A further analysis was made for a content, a determinant, and a location 
variable which the author felt shared high common variances, to serve as an illustra- 
tion of the spurious correlation obtained when such variables obtained from a single 
set of responses are treated together in a correlation matrix. 


RESULTS 


Location. Two factors were extracted. For all three groups Factor I’ is heavily 
loaded with D and Dd. Factor II’ tends to load on W, with zero loadings on D and 
Dd for the entire sample. Similar loadings are obtained for the subsamples, with the 


one for W reaching significance in the remitted group. Factorially the three groups 
are very similar. 


TaBLeE 1. INTERCORRELATION OF RorscHAcH Scores FoR SCHIZOPHRENICS! 
} 


Total Sample Remitted Schizophrenics Hospitalized Schizophrenics 
D Dd F M FC FSH! D Dd F M FC FSH} D Dd F M FC FSH 


—20 -08 -30 -34 -22 -16 
63 82 

30 
43 41 
56 60 27 
07 14 «16 06 


62 69 62 46 























‘Because of magnitude of 8. E. of tetrachoric r and relatively small N, lower limit of .40 was set 
for significance of r’s and factor loadings, as was done in a previous study“). 


Tasie 2. Roratep Factor Matrices For RorscHacnH Location ScoRES 








Total Sample Remitted Schizophrenics Hospitalized Schizophrenics 
II’ h? I’ II’ h? I’ II’ h? 
37 15 -39 41 32 ~23 34 17 


08 46 84 01 71 72 00 52 
01 66 77 -01 59 78 -04 61 














Determinants. Two factors were extracted from the five determinants. For the 
total sample and for the hospitalized subsample one factor is most heavily loaded 
with F and M while the other is loaded for FC and FSH. For the remitted subsample 
one factor is loaded with F, M, FC, and FSH, the other with FC and CF. 


TaBie 3. Rotatep Facror Matrices ror RorscHach DETERMINANTS 








Total Sample Remitted Schizophrenics | Hospitalized Schizophrenics 
III’ III’ 





F y 03 
M 5g 00 
FC ¢ y 62 
FSH ‘ -27 
CF y 69 














‘Shading was treated as a single category in accordance with the procedure of the original study 
(10 p. 123 ff.) 
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Tasie 4. Roratep Factor Marrices ror D, F, anp P 








Fotal Sample 
VI 


Remitted Schizophrenics | Hospitalized Schizophrenics 


Scores | 
v’ VI’ h? VW’: 4 3... mae h? 





| 

| 

| WW “Oo  @ Cs 
91 8 | -16 19 9% 9% 

| 


F 


| 
D 93 ¢ | ; 90 84 
} 96 
P P oy 56 


68 67 51 45 52 78 





D, F, and P. Two factors were extracted for the entire sample and remitted sub- 
sample. One, accounting for three-fourths of the variance, loads heavily with F and 
D and significantly for P. The other has a significant loading only for P. For the 
hospitalized subsample there are three factors, one loading significantly with F, D, 
and P; the other two with P. D and F are so similar in their loadings and so high in 
their intercorrelations as to raise the question whether they are not almost identical 
variables. 


Discussion 


With respect to location, Factor [' is clearly a measure of the tendency to res- 
pond to the details of the blots. D and Dd are closely related, the latter probably 
representing an extension of the former for some subjects who in the course of giving 
a long protocol use rare details as well as common details. The tendency to use the 
whole blot in responding is quite distinct from an orientation to details, so that W 
has a negative loading on Factor I’ and a positive loading on II’ for which D and Dd 
show zero loadings. There is thus ample justification for treating whole responses 
separately from detail responses but not for treating common and rare detail res- 
ponses separately on the basis of present findings. 

For determinants the findings are not as clear. In contrast to previous findings 
FC is not as closely related to M factorially as it is to FSH in two groups and to CF 
in one. A review of factor loadings in previous studies shows that FC clusters with 
other variables besides M: in one“) even with CF. Thus the relationship of FC 
with M is a function of a factor which is loaded with other variables besides these 
two. In two groups the relationship between FC and FSH, hitherto obscured by the 
division of shading into many subcategories, appears to represent a controlled re- 
action to the color aspects of the blots. One is to the brighter, the other to the darker 
hues. In the remitted group the clustering of FC and CF responses may be due to 
the presence of greater control in the CF responses of this subsample so that they 
more closely resemble FC responses. There may be qualitative differences between 
groups for CF responses. In this connection Phillips“? found that schizophrenics 
who improve following shock treatment showed a higher color drive score than those 
who did not improve. His color drive score was derived from analysis of content 
without reference to determinants but a major portion of the color drive scores ap- 
peared to come from CF responses. 

F and M cluster together consistently and may represent an orientation to the 
form aspects of the blots, in contrast to FC, FSH, and CF which involve the use of 
color to elaborate responses. 

The results for F, D, and P point up the danger of treating separately compon- 
ents which arise from the same basic source. While P has independent loadings, D 
and F are very similar. This may be observed also in their high intercorrelations in 
other studies, never lower than .69. These two factors, and to a lesser extent P, seem 
to reflect a tendency to respond to the blots with simple form in a commonly ob- 
served detail. The D and F factors are present together in approximately forty to 
fifty percent of all Rorschach responses of all groups. The differences between 
groups and much of the variance attributable to distinctive traits will generally be 
found in the remaining responses. Treating R, D, and F as completely independent 
variables and then finding a high common variance may be a highly refined form of 
gilding the Rorschach lily. 
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On one point the author is in complete agreement with previous studies, 7.e., 
that Rorschach scoring categories need a thorough reworking. A scoring system 
should be developed to maximize the independence of the various categories and to 
produce scores which can be handled by conventional statistical procedures. If this 
means revision of both administration and scoring of the test, let us get to work on it. 


SUMMARY 


Factor analyses were performed for Rorschach location and determinant scores 
on a sample of 148 schizophrenics and on subgroups within the sample of 74 remitted 
and 74 hospitalized schizophrenics. Three changes from the procedures of previous 
studies were made. Total number of responses (2) was omitted from the correlation 
matrices. Location and determinants were treated separately. Only variables for 
which more than half of the subjects gave one or more responses were included in 
this study. 

For location two factors were found. Common and rare detail scores clustered 
together on one, while whole responses tended to load on the other factor. For de- 
terminants two factors were found. On one, pure form and human movement res- 
ponses clustered together. On the other, shading and color responses showed high 
loadings though not as consistently as F and M. Separate factor analyses were per- 
formed for F, D, and P — determinant, location, and content scores which were be- 
lieved to be highly interrelated. They were found to have a high common loading on 
one factor, especially F and D. This illustrates the problem of including such scores 
in a common matrix on the assumption that they are independent. 
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EFFECTS OF DIFFERENT METHODS OF STRUCTURING THE 
RORSCHACH INQUIRY UPON DETERMINANT SCORES! 


STANLEY W. KLINGENSMITH 
University of Pittsburgh? 


PROBLEM 


Most Rorschach workers indicate that it is essential for the examiner to avoid 
suggesting determinants to subjects during administration of the inquiry. Without 
experimental verification, it is apparently assumed that the greatest potential source 
of error lies in E's focusing S8’s attention in various ways to blot characteristics that 
he would not otherwise have considered as determinants of his responses, thus biasing 
his report during inquiry. The rationale may include Rapaport’s“? view that “‘. . . 
the issue is not what is perceptible, but what is spontaneously perceired—that is, what 
features of the perceptual mass initiate association processes spontaneously and in- 
fluence their course.”’ Siipola? points out that examiners employ widely different 
criteria in judging whether a given determinant should be scored. Levin“) question- 
ed whether the inquiry actually does reveal valid information regarding determin- 
ants involved in free association responses. 

In addition to the more obvious effects of differences in inquiry methods, the 
importance of the problem is further emphasized by studies which show that a var- 
iety of subtle factors may affect the number of determinants elicited. For example, 
Gibby® and Gibby and Stotsky“: » investigated the influence of type of inquiry, 
examiner differences, and ‘‘determinant shift”. The purpose of the present study was 
to investigate the changes in determinant scores related to changes in level of test 
structure. 


METHOD 
Levels of Test Structure. Four levels of test structure were emploved. Structuring 


refers to the instructions and aids used to guide subjects in reporting. Written in- 
structions were provided. A description of the levels of test structure follows.* 


1. Free association. In structuring the free association, general statements which indicated 
the nature of the task were provided. Instructions similar to those recommended by Beck “) were 
used. 


2. Inquiry S1. Instructions preceding this inquiry requested description and explanation of 
responses. After responses were freely elaborated, subjects were asked: “Is there anything else 
that gives that impression?”’ or its equivalent. 

3. Inquiry S2. Instructions emphasized the need to explain in great detail everything in- 
volved in the perception of responses. Subjects were encouraged to report: ‘“What else in addition 
to the shape, if anything, helps you to see what you do?”’ 

4. Inquiry S3. In this inquiry brief definitions of determinants followed an extensive ex- 
planation of the nature of determinants and the need for detailed reporting. The printed defini- 
tions served as a check-list during administration of this inquiry. Subjects were asked to give the 
part of the definition which applied to each reported determinant, for example, which of the 
colors: red, green, black, gray, etc. was responsible for their reporting color as a determinant. 


Determinants. Only the presence or absence of the following determinants was con- 
sidered, independent of their combination with each other: chromatic color (C), 
achromatic color (C’) vista (V), human movement (M), animal movement (FM), in- 
animate movement (m), texture (T), and form (F). To simplify the task for subjects 
C and C’ were combined in a single definition of color, and M, FM and m were com- 

1Adapted from a dissertation submitted in partial fulfillment of requirements for the Ph.D. 
degree, University of Pittsburgh, 1956. The author wishes to express his appreciation to Dr. Roy M. 
Hamlin for his guidance and counsel regarding the research on which this article is based. The assist- 
ance of Mr. Alvis M. Eskey in determining the reliability of scoring, and Mrs. Mary G. Zima in 
arranging for ay ep of the student nurses is also gratefully acknowledged. 

he author is now employed at the Mental Health Clinic, Leon County Health Department, 

Tallahassee, Florida. 

3Instructions employed are presented verbatim in the original work, along with discussion of 
structuring used by experimenters who employed the group method of Rorschach administration. 
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bined in a single definition of movement. These were separated during scoring of the 
protocols, according to the card area employed, the content of the response, and— in 
inquiry S3— the part of the definition given by S. 


Subjects. The experimental group comprised 40 student nurses, 20 in their second, 
and 20 in their third year of training (mean age 19.82). Base hospital affiliations 
throughout Greater Cleveland (Ohio) and several smaller communities indicate wide 
diversification of general background. A control group of 40 student nurses (mean 
age 20.05) was formed by matching for base hospital affiliation, year in training, and 
day of testing. 


Test administration. All tests were individually administered by E. The procedure 
for the experimental group was as follows. The free association was administered to 
S in the usual manner, except that only two responses per card were permitted. In- 
asmuch as all subjects could not be expected to give two responses per card, fifteen 
of the 20 possible free association responses were selected by a predetermined method 
prior to administration of the first inquiry. Then inquiries 81, 82, and S3 were given 
in that order, one after the other. Efforts were made to copy responses verbatim. 

The control group was given the free association and inquiry 83 only. The pur- 

pose of the control group was to determine the influence of practice effects and re- 
peated exposure to the stimulus materials, 7.e., the cumulative effect of several in- 
quiries upon the number of determinants reported by the experimental group in 
inquiry S83. 
Treatment of the Data. Criteria for scoring were rigorously defined on the basis of 
other test records. In the free association and inquiries $1 and 82, determinants were 
scored on the basis of direct or indirect evidence in subjects’ verbalizations that de- 
terminants were used. Basic content of certain responses was regarded as sufficient 
evidence. In inquiry 83 decision regarding use of determinants was left entirely to 
subjects rather than to E. 

The free association and inquiries 81 and S2 of 20 protocols of the experimental 
group were scored independently by E and another Rorschach worker. Inter-scorer 
reliability is indicated by 94 per cent agreement for all determinants, and 89 percent 
agreement for determinants other than Form. E’s scoring of the protocols was used 
in tabulating results. 

Form was scored for most of the responses at each level of test structure. This 
determinant was therefore excluded from total score comparisons inasmuch as its 
inclusion would obscure differences among other determinants. Since distributions 
of scores for separate determinants were skewed, the nonparametric rank test and 
sign test °) were applied to test the significance of differences. The “‘t’’ test was used 


in testing the significance of differences of combined determinant scores—total less 
Form. 


RESULTS AND INTERPRETATION 


Table 1 shows a comparison of the experimental and control group scores in the 
free association and inquiry 83. Chance factors may easily account for the differ- 


TABLE 1. CoMPARISON OF THE ScoRES OF THE EXPERIMENTAL GROUP AND THE CONTROL GROUP 








Free Association Inquiry 83 
Group Group 


Determinants Diff. E C 





Achromatic color (C’) 
Vista (V) 191 
Human movement (M) 2 82 
Animal movement (FM) | . 6: 214 
Inanimate movement (m) | ’ ; 40 
Texture (T) 3 375 
Total (Form excluded) : : 1250 


231 


| 
Chromatic color (C) 5. 141117 
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ences that occur. The data indicate that the increase in determinant scores in in- 
quiry 83 for the experimental group is essentially independent of the effects of having 
taken inquiries $1 and 82. 

Table 2 contains the scores of the experimental group for the free association 
and three levels of inquiry. There is an increase in number of determinants at each 
level of inquiry, except for m at inquiry 83. (This decrease in m is apparently an 
artifact resulting from its being frequently overlooked in inquiry 83 because the de- 
finition was combined with that of M and FM. The m scores are omitted in the 
following comparisons and discussion.) The high level of significance obtained for 
most of the comparisons indicates a sufficient degree of consistency in the data to 
conclude that the trends are real. Human movement is readily elicited in the free 


association, while other determinants are more readily elicited with varying degrees 
of increased structuring. 


TABLE 2. DETERMINANT SCORES OF THE EXPERIMENTAL Group AT Eacu LEVEL oF TEST STRUCTURE 








Inquiry 
$1 $2 83 


| 

Chromatic color (C) 30 | 87** sean 114* 
Achromatic color (C’) | 68** 129** 257** 
Vista (V) | 7 92° 196** 
Human movement (M) y 92 96 105 
Animal movement (FM) 131** 163** 220** 
Inanimate movement (m) 52* 62 43 
Texture (T) 79** 132** 360** 
Total 302 579** 791** 1322°* 


Free 
Determinants Association 











*Significant at the five per cent level. Note—Significance values are for the increase in 
**Significant at the one per cent level. scores over the scores at the previous levels. 


Table 3 shows the pattern of scores more clearly. Using the number of determ- 
inants in inquiry 83 to represent 100 per cent, Table 3 contains the percentages of 
occurrence at each level of test structure. On the average, only 23 per cent of the 
determinants were elicited in the free association. This is understandable inasmuch 
as there was no request for elaboration of responses. That only an additional 21 per 
cent of the determinants were elicited in inquiry $1, when there was a request for 
elaboration, indicates that subjects of about average intelligence do not readily com- 
municate information regarding characteristics of Rorschach cards that account for 
their responses. Introspection with the aid of definitions provided in inquiry S83 re- 
sulted in a marked increase in the number of determinants reported; 40 per cent of 
the total number were elicited in this inquiry. 


TABLE 3. CoMPARISON OF THE SCORES OF THE EXPERIMENTAL Group, EXPRESSED AS PERCENTAGES 
oF 83 Scores “Occurrine” at Eacu Leveu ror Each DETERMINANT 








Determinants 
Level of Structure ‘¢’ V m* 





Free Association 03 22 
Inquiry 81 23 14 
Inquiry 82 24 11 
Inquiry 83 50 53 
Total percentages 100 100 











*Inanimate movement (m) not included because percentage values are misleading. 


It may be noted in Table 3 that the pattern of scores for C’, V, and T is different 
from that of other determinants. Only three per cent of the C’ determinants were 
given in the free association as compared with 78 per cent of the M determinants. 
Fifty per cent of the C’, V, and T determinants were not reported until inquiry 83. 
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Human movement is readily elicited, while FM and C fall between M and the shad- 
ing responses— C’, V, and T— in this respect. Study of the protocols showed that 
the high proportion of M determinants elicited in the free association is not an arti- 
fact related to criteria employed in scoring. 


Discussion 


The present study indicates that the number and pattern of determinants 
elicited may vary greatly with different methods of inquiry. This finding has im- 
portant implications regarding reliability, validity, and standardization of the 
Rorschach. For example, the fact that many determinants are not readily elicited 
permits the influence of extrinsic factors such as set of the subject and examiner 
differences, probably to a greater degree than is generally realized A need for re- 
evaluation of the great concern with the danger of ‘‘suggesting’”’ determinants is 
indicated. Evidence in this study that exposure of the experimental group to inquiry 
$1 and S2 did not affect the results in inquiry 83 suggests that we are dealing with a 
relatively stable phenomenon. It is quite possible that perceptions are sufficiently 
“set” at the moment of closure that direct questioning regarding determinants will 
not substantially alter the perceptions or bias the report of S. 


SUMMARY 


The effect of different methods of Rorschach inquiry upon number of determ- 
inants scored was tested. The free association was considered as one level of struc- 
turing. The three inquiries varied in degree of structuring from one which provided 
little instruction or pressure for elaboration of responses to one which provided ex- 
planation and definition of determinants. The following results were obtained: 


1. There is a corresponding increase in the number of determinants elicited as 
the directions to subjects for reporting determinants are more detailed and specific. 
Even after having been administered the free association and two separate inquiries, 
in the third highly structured inquiry there was a marked increase in the number of 
determinants elicited, excluding form—40 per cent of the total. 


2. The pattern of determinant scores varies greatly according to the level of 
test structure. Human movement is readily elicited, even in the free association. The 
shading determinants are not readily elicited in the free association or in inquiries 
with little instruction or probing. 


3. The increase in number of determinants elicited when definitions were pro- 
vided was not a function of the experience of two previously-administered inquiries 
which did not contain definitions of determinants. 
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PSYCHOMETRIC SCALES FOR THE RORSCHACH POPULAR RESPONSE 
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PROBLEM 


The “popular” response, the most quantifiable of all Roraschach variables, has 
received little attention from experimenters. Hertz) gathered in 1938 a list of 
‘“‘populars”’ agreed upon by three or more scorers, among whom were Beck, Loosli- 
Usteri, Gardner, and Vernon. Not until 1950 when Beck’s Spiegel study °’ appeared, 
were there any more than brief references to this variable. Then in 1951 Molish’s 
article on the popular response in records of normals, neurotics, and schizophren- 
ics®) and in 1952 Friedman’s study“ were published. 

The concept of popularity or communality“?: “ has been interpreted as re- 
flecting ‘‘ability to adjust to the practical requirements of life, to a sufficient partici- 
pation in collective or common thinking’’ (Oberholzer, Piotrowski) ; ‘“conformity in 
thinking processes’’; ‘‘ability to adapt one’s thinking to that of the common man, 
ability to experience communal thinking’’ (Molish) ; “‘F plus at its most concentrat- 
ed’’, “‘cooperativeness, ability to participate in common thinking or wish, hence 
adaptability’ (Beck). Recent studies of normals and schizophrenics in which P is 
shown to be a differentiating factor®: '°’ support somewhat these interpretations. 

Antecedent, however, to any interpretation is the need for precise definition of 
the P factor which can be established objectively only on a statistical basis. A res- 
ponse thus may be defined as “popular” according to the frequency with which it 
occurs in a normal population. P responses are simply those most frequently given 
to a specific area of a blot. Rorschach suggested a criterion of “once in three re- 
cords’, and Oberholzer also used this criterion. Others have used a criterion of once 
in six, particularly for children’s Rorschachs. Other factors, such as the size of the 


sample upon which frequencies are based, IQ, socio-economic level, type of culture, 
etc., of course, need to be controlled. If, as Beck says, the P factor provides ‘‘the 
most satisfying opportunity for quantitative method in Rorschach test procedure” 


(2) it should certainly be quantifiable by way of the rating scale technique, and this 
we have attempted to do.! 


PROCEDURE 


For the development of such a scale it is necessary only to assume that the P 
factor applies to the continuum which ranges from originality on the one hand to 
extreme communality, approaching universality, on the other. To provide an ob- 
jective measure of this continuum, the author, in 1948, compiled a four-point psycho- 
metric scale from the lists of “popular’’ responses published by the three chief 
authorities in the field; Beck, Hertz, and Klopfer. Beck’s later study of the Spiegel 
Mail Order House employees and Molish’s comparison of their records with those of 
neurotics and schizophrenics provide additional data for a scale of normal populars. 
The scales presented in this article are a revision which incorporates these additional 
data. 

In Beck’s earlier list, as in those of Hertz and of Klopfer, the criterion was that 
of frequency, although the statistical basis was not elaborated upon. In fact, Hertz 
was the only one who presented data to substantiate the criterion of once in three 
records. The items in Klopfer’s“> list, admittedly ‘‘a selected sample’’, were quite 
vavuely defined for example as, “any creature with body in center and wings” for 
the W of Card I, and “any winged creature” for Card V. 

In Beck’s recent study, however, he uses three criteria as follows: 1. Operation 
Stimulus: the percentage of subjects selecting the area and giving this response must 


‘Developed under the inspiration of Joseph Zubin, in whose Manual of Projective and Cognate 
Techniques?) the author’s first version of the scale appeared. 
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be three times or more as great as for the next most frequent response; 2. Operation 

Group: fifteen per cent of the group must give the P response to this area; 3. Any 

— which is P must constitute a minimum of six per cent of the selections in its test 
gure. 

With these more rigid (though still rather arbitrary) criteria the resulting list 
differs in several respects from the list published in Beck’s 1944 text. There are five 
omissions, two of areas and three of responses to areas retained as popular, 7.e., D6 
Card IV, “foot or shoe’, W Card VII ‘‘clouds’’, “‘butterfly’’, “moth” for [3 Card 
III, and “octopus” for D1 Card X. There are four additional areas, besides two 
changes in the interpretation of areas, 7.e., D4 “human figure” and the interpretation 
of D3 in Card I as “a person’s legs’’; D4, “‘tree”, “‘bush’’; D1 interpreted now as 
“rodent, rat, mice, squirrel’ as well as “bears”, “‘beavers’’, ‘‘muskrats” in Card 
VIII; D2 ‘“‘dog’’, D12 “‘sheep”’ in Card X. The total number of areas is actually in- 
creased from 20 to 23 and the number of concepts involved is increased to 82.” 

Klopfer finds popular response: in ten areas on six cards, involving 25 different 
concepts. Hertz finds populars in 24 areas on nine cards, involving 64 concepts. 
Molish’s results yield eleven areas and 18 concepts. 

The normals in Molish’s study were the carefully selected Spiegel group which 
represented the middle two-thirds of the general population in intelligence, age, and 
socio-economic level. Chi-square values were run to si t out the responses which 
occurred with significantly greater frequency among normals than among neurotics 
or schizophrenics. The areas and concepts selected by normals with significantly 
greater frequency than by neurotics or schizophrenics are considered essentially 
normal responses and are utilized in the present scale through Ratings Two to Five. 

The following steps were selected as the basis of the scale. The intervals be- 
tween steps are not assumed to be equal. 

0. Original, not appearing in any published frequency table: Beck revised list 

(B), Hertz (H), Klopfer (K.), Molish (M). 
1. Tending towards originality; Occurring only once in 100 records. Listed by 
Hertz as original. (See samples) 


2. Common but not popular. Listed in one only of the published frequency 
lists. Tendency toward popularity. 


3. Quite popular. Listed in two only of the published frequency lists. 
4. Very popular. Listed in three of the four published frequency lists. 


5. Extremely popular, almost universal. Listed in the three well known pub- 
lished lists, also found by Molish to occur with significantly greater frequency 
in normals than with schizophrenics or neurotics. 


Fic. 1. Popunariry ScaLtes: GuIpE To THE PsycHometTric ScoRING OF THE PopULAR RESPONSE 
In Rorscuacn Recorps oF ADOLESCENTS AND ADULTS 





Rating 0. Original: not found in any frequency table. 





Rating 1. Tending towards originality. 
Carp Location (Beck) RESPONSE 
I Ww human pyramid made up of acrobats 
North wind with his mouth open and blowing 
nightmare that you see in dreams 
bug in center, crawling into torn leaf 
mountain with reflection in water below 
Russian Cossack with bushy beard 
tooth because of its roots 
insect flattened out as if stepped on 
cross section of the brain 
D4 elves sitting back to back, cone shaped hats on 
Vv Dd22 man in the mountain 


*Beck states that the number of P associations resulting from this study is 21. Nevertheless, a 
count reveals 23 different locations. He includes one which he admits does not satisfy his first criterion, 
but which is very close and was in the old list, i.e., “butterfly’’, ““moth’’ D3 Card II. 
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III V W 


y= 


apysseayes 


V 
Dd33 


animal, hands up as if trying to grab butterfly, 
tatooed on ely 
cave with butterfly in middle 
hat with feather or puff on side of it 
Indian buffalo god 
king surrounded by animals 
person’s arms, of a swimmer 
—— design 
mmock with two people lying in it 
a fur cape 
book ends 
two persons 
cobras about to strike 
trumpeteers with horns, back to back 
snake’s head and reflection 
chimney with rays around it 
frog, bisected 
marionettes dancing with big heads on them 
dog balancing something on his nose 
forest with lighter landscape in back 
decorated bowl 
wood carving with animals 
leaf with parts torn out 
ornament for a tree 
boat with sails 
woman gazing into mirror seeing reflection 
woman in costume of the 90’s 
hat, jacket, dress on a rack 
person’ ’s hand, thumb up, pointing 
person’s funny profile as in a cartoon 
wind as seen in fairy books, flying 
tapestry 
wall paper 
fancy monogram 
mumm 
child’s head 





Rating 2. Common but not popular. 
Carp = tone 


D3 


RESPONSE 

face (H) 

any creature with body in center and wings (not 
rated elsewhere) (K) 

legs, lower part of body, person’s legs and back, 
child’s legs (B) 

a _ CB) 
~~ a fpr 


aaa du) 
dancers, clowns, jitterbugs (B) 
man, lady (H 
rabbit, bull, rhinoceros (K) 
butterfly, moth (B) 
dressed up animals, two human figures in a bending 
position (K), persons doing something (H) 
characters, caricatures, dancers, jitterbugs, s “7, | 
figures, marionettes, humans, people, waiters (B) 
bow, (B) moth (H) 
bat, (H), ape, gorilla, bearskin rug, large furry 
animal (B). 
foot, boot, shoe (H) 
any ’ winged creature (not rated elsewhere) (K) 
leg (animal or human) (B) 
Persons (H) 
Person (H) 
butterfly, moth (H) 
any k kind of four legged animal in motion (K), 
rodent, rat, squirrel, beaver, muskrat (B) 
butterfly’ (H) 
spine, fish bone (H), bone structure, vertebra 
skeleton (B) 
bush (B) 
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D1 person bending over doing something (H) 

D2 animals face or head (H) 

D3 persons, clowns, ghosts, gremlins, witches, myth- 
ological creatures (B) 

D9 tree (H) 

D1 crustacean (B) 

D4 garden snake, tobacco worm, eel (K), serpents (H), 
rabbit’s face (H), head of an animal with long 
ears or horns, as of a donkey or goat (K) 





Rating 3. a= Popular. 
CaRD LocaTIoN RESPONSE 
WwW bird (H, K) 
W or side 


humans, a” (B, H) 
animal ( 
buttery, (H, K) bowtie, hair ribbon (B, K) 
animal oe, hide, fur, pelt (B, H 
bird (H, K 
am rug at, (B, H) 
face or head « of person (B, H) 
bears, mice (B, M 
ribs, backbone (B, H) 
tree (B, H) 
human figure (B, M) 

rson’s face (B, H) 

obster (B, M), octopus (H, K) 
ia (including special breeds) (B, M) 
caterpillars, worms, seahorses (H, K) 
rabbit (B, 
sheep (B, M) 





Rating 4. Very popular. 
Carb LocaTION RESPONSE 
V WwW bat, butterfly, moth iN H, K) 


Ix D4 human head (B, H, M 
x D5 rabbit’s head (B, H, K) 


Rating 5. ae popular, almost universal. 
LocaTION RESPONSE 

yy Ww bat, butterfly, moth 
II Di dog, bear 
iil Di persons 
VI Wor D1 animal skin 
Vill D1 four-legged animals 
x D1 crab, spider 








DIscussIoNn 


It should be noted that “response”’, as referred to in this scale of the popularity 
continuum, refers not merely to location. Neither is it solely a matter of the words 
used. Since different meanings are denoted by different words, all alternative verbal- 
izations listed by the different authors are counted as different responses unless the 
words are essentially identical in meaning, e.g., ‘persons’ or “humans”. This would 
seem to be the only accurate method of compilation, since some authors, (Beck, for 
instance) list many similar concepts as separate populars and yet expressly mention 
other similar ones which are not to be considered popular. 

It is interesting to note that the most universal ‘‘popular’’ is still “bat’’, Card 
1 W. In Hertz’s 1938 article, she reported three areas and eight concepts upon which 
all scorers agreed. Besides the “‘bat’’ of Card I, the W of Card III, interpreted as, 
‘persons, men, or clowns” and the W of Card VI, interpreted as “animal, skin, hide, 
fur, pelt’”’ were reported by all investigators. These responses continue to rate high 
on the scale, as is seen in Rating 5. The W of Card V, “‘bat”’, is also highly popular. 


although not appearing in the Molish list. However, iit is surpassed i in frequency by 
the “bat” of Card I. 
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According to Ranzoni"), Cards I, V, VI, VIII, and X commonly yield animal 
responses and Cards III and VII, human responses. The four lists used as a basis for 
our scale give animals for Card II also, while Hertz lists an animal or insect response 
for every card. Beck lists no animal response for Cards III, VII, or IX; Klopfer, none 
for Cards IV, VII, or IX; Molish, none for Cards III, IV, V, VII, or IX. Hertz and 
Beck list human responses for Cards III and VII, as in the Ranzoni study, and also 
for Cards I and II. Klopfer finds a human response popular only in Card III. Molish 
lists human populars in Cards I, II, and IX. 

Bochner and Halpern quote a “‘list of the usually accepted popular answers” 
(8, p. 18) which appears to be based upon Klopfer’s list with a few Beck populars 
added. It covers 11 areas and gives 16 different concepts. In another section of the 
book the authors refer to 18 popular responses in all - ®), but this writer counts 
only 16. No P are listed for Cards VII and IX. No statistical basis is given and no 
explanation for the inclusion or exclusion of any item or area. 

McCall and Doleys®? listed no specific popular responses, but on the basis of 
further analysis of Beck’s Spiegel sample, constructed tables of rough norms for 
popularity saturation at various response levels. It is likely that psychometric 
ratings of P will also ultimately have to be corrected for varying levels of R. 


SUMMARY 


A five step psychometric scale, based upon the lists of populars published by 
Beck, Hertz, Klopfer, and Molish, is presented as a tool for more refined, objective 
scoring of the P factor in Rorschach records of adolescents and adults. The present 
scale attempts to achieve greater objectivity and refinement which can be further 
tested both clinically and experimentally. 
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SELF CONGRUENCE, BY SEX AND BETWEEN THE SEXES, IN A 
“NORMAL” POPULATION? 
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PROBLEM 


The purpose of this study was to investigate for two unselected samples of 
presumably ‘“‘normal’’ men and women, the hypothesis that positive, significant 
intrasex and intersex relationships would be found among three specific measures of 
the self. These measures were in the form of self ratings and were designed to assess 
a self-Ideal (Scale sI), an Actual self (Scale As) and the Social self (Scale Ss). The 
expectations of overall and specific positive relationships were made on the basis of 
theory, empirical data“: ?: * *-®), and on parsimony. Specific separate analyses were 
computed for each sex sample and, between the sexes on each self-rating scale. The 
obtained data may be seen in the context of self congruence theory as it applies to 
men, to women and to the relationships between the sexes. 


PROCEDURE 


Subjects. Ss were members of a pool of 76 male and 66 female, presumably well- 
adjusted, undergraduate volunteers who were obtained individually and in groups 
from university dormitories and from fraternities. From this larger pool, 40 men and 
40 women were selected so that Ss in each sex sample (Table 1) would be similar in 
age, class distribution and grade point, to the opposite sex. Although comparable, 
the samples were not representative of the university population. Data for this 
selection procedure were obtained from a personal data blank completed by each S. 


TABLE 1. CHARACTERISTICS OF THE SAMPLE 





Men Women 
Factors M o M a 
Age 19.3 2.05 19.4 1.41 


Class Distribution Sex samples nearly identical. 
Grade Point 2.63 53 2.78 .46 








Method. Ss met in small groups and were told that F was investigating and trying to 
understand the general attitudes of students in order to be of help to them. Their 
cooperation and attention to instructions were encouraged. Each S was given a 
folder which contained three forms of the self-rating scale (see below), a personal 
data blank and definitions of the twenty-six traits’ to be rank-ordered. Two anchor- 
ing definitions were provided, indicating the extremes of each trait continuum which 
was to be utilized in making the self-ratings. For example, the anchoring definitions 
of the trait of sociability were: “‘A high degree of sociability means that the individual 
is friendly and interested in others and seeks contact with them; a low degree of 
sociability means that the individual tends to withdraw from others and keep to 
himself.’’ Ss were instructed to read through the trait definitions and to then make 


1The authors wish to express their gratitude to Dr. Harold P. Bechtoldt for his invaluable assist- 
ance in the statistical and editorial preparation of this manuscript. Dr. Leonard D. Goodstein made 
several helpful suggestions. 

2The authors are indebted to Drs. Charles D. Spielberger and Leopold O. Walder and Miss 
Jacolyn Young who collected the data for this study. 

*Autonomy, Creativeness, Dependability, Dominance, Emotional Maturity, Flexibility, General 
Success, Individuality, Initiative, Intelligence, Interest in Opposite Sex, Motivation, Overall Ad- 
justment, Physical Attractiveness, Popularity, Prestige, Security Minded, Self Acceptance, Sincerity, 
Bociability, Self Confidence, Social Poise, Sportsmanship, Self Understanding, Trustfulness, Under- 
standing of Others. 
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the self-ratings on the three forms, following closely the specific directions given on 
each form. They were told to make the self-ratings in a standard sequence and to 
keep the sets of ratings independent of each other. 

For the self-Ideal (Scale sI), Ss were instructed to rank the twenty-six items in 
order of importance to them. Rank 1 was to be given to that trait about which they 
were most concerned and in which it was most important to stand personally high 
on. A rank of 2 was to be given to the next most important trait and so on through 
number twenty-six. Number twenty-six, it was then pointed out, would be the trait 
of least importance to Ss to stand high on. 

The Actual self (Scale As) instructions were similar to those of Scale sI except 
that in this case, the rank of 1 was to be given to that trait which was most character- 
istic of the individual, 7.e., the trait which S felt was most typical of himself and most 
necessary for an accurate description and full picture of himself. It was explained 
that a large amount of a particular trait was not necessary in order for it to be char- 
acteristic of a person. 

For the Social self (Ss), Ss were instructed to assign the first rank to that trait 
which they considered most acceptable for a person in our society to have, i.e., the 
trait which most people would consider a good one to have. The additional instruc- 
tions were similar to Scales sI and As. 

The personal data blank was administered last and was used to obtain, among 
other things, Ss’ ages, academic classifications and grade point averages, which were 
utilized in the subject selection process mentioned above. Within group comparisons 
were made for each sex sample by computing all possible intercorrelations*t among 
the three scales. The item values used in these computations were mean ratings 
(N = 40) on each trait. 


RESULTS AND DISCUSSION 


It can be seen in Table 2 that high, positive and significant intercorrelations 
exist among the three scales for men, indicating that these Ss have similar concep- 
tions of self as defined. Such an interpretation must be based on the assumption 


TaBLe 2. Wirutn Group CorRRELATIONS BETWEEN Eacu Parr oF SCALES 
FOR THE Two SEXES AND THE DIFFERENCES BETWEEN CORRELATIONS 
(N = 26 For BacH CELL) 








Men Women 
Scales Diff. Diff. 


sl-As 
sI-Ss 

As-Ss 
sI As-As Ss 15 





ll .35** 
04 nt 08 














*<.01 * p<.05 


that Ss did in reality react differentially to the instructions. The fact that the differ- 
ences between these correlations are not significant supports the aforementioned 
finding that there exists a high level of agreement among the three measures of the 
self, and supports the theoretical argument that “‘normal’’ adjustment and congruent 
self conceptions are positively related. It confirms, with a group of older Ss, the 
positive relationship between the actual self concept and the ideal self concept re- 
ported by Hanlon et. al. for a group of high school students. 

The intercorrelations among the three scales for the women are all positive and 
significant. Reference to Table 2 shows that the sI-As correlation is significantly 


‘In all cases, the correlations were Pearson r’s, 





290 JOHN G. MARTIRE 


larger than the correlations for sI-Ss and As-Ss. With these female subjects, the 
introduction of the Social self scale significantly lowers the levels of agreement or 
congruence among these measures of the self*. This obtained lack of congruence for 
the female Ss implies that these women see themselves as being and behaving in a 
manner different from what they believe society expects of them. Both their hopes 
about themselves and their actual characteristics seem to be at variance with 
their understanding of social expectations. In dynamic terms, one may infer con- 
flict between personal strivings and either present social standards or introjected 
values. 

Table 3 shows the intersex correlations for each of the three scales. All of the 
coefficients are high, positive and significant, suggesting a high level of agreement 
between the men and the women under this set of instructions. The inference is 
-made that the sexes have highly similar concepts of self, ideally, realistically, and 
socially. Again, reference to Table 3 shows that the Social self (Ss) correlation of .85 

TaBLe 3. CorRELATIONS BETWEEN MEN AND WoMEN For Eacu ScaLe 


AND THE DIFFERENCES BETWEEN THESE CORRELATIONS 
( N = 26 ror Bacu CELL) 





Scales r Diff. 





sI-sI .03** 
As-As 
Ss-Ss 


-06 
.02 
-08* 





** 9 <.01 *p<.05 


is significantly lower than the self-Ideal (sI) correlation of .93, meaning that these 
men and women are more alike in terms of their self-Ideal than in terms of their 
respective social expectations. Such a finding may be attributed to different stand- 
ards set for each sex. 

In this context, although intersex comparisons between the correlations on each 
pair of scales (Table 2) show substantial absolute differences in two out of three 
cases (Diff. sl Ssm-w = .21; Diff. As Ssm-w = .25), none of these differences is 
significant. This certainly suggests, however, that women show lower self congruence 
than do men perhaps because these college women have greater social pressure to 
conform and are consequently less independent. 

In order to ascertain the degree of subject agreement in the trait rankings, co- 
efficients of concordance (W) were computed. The results of this comparison for 
the men, shown in Table 4, indicate that male Ss ranked the traits similarly on the 
three scales. However, this level of agreement (.82) is taken to mean that while 
there was some consistency in the rankings, the coefficients were not so high as to 
preclude differential effects of the instructions which defined each scale. For the 


TaBLe 4. W VALUvEs For Scaugs sl, As, and Ss, as A GROUP AND 
INDIVIDUALLY FOR THE Two SExEs 





W Coefficients** 
Scales M Women 


sl, As, Ss 
sI 


As 
Ss 








**» > .01 in each case 


5An additional analysis was made to ascertain which traits seemed to account for the significantly 
lower correlations (sI-Ss; As-Ss) for the women. When these traits were identified, their meaning 
seemed too confused and ambiguous to report here. 
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women, too, the order of the W coefficient (.73) allows for rejection of the hypothesis 
of parallel forms. 

The coefficients obtained for each scale, for each sex independently, are low, 
although significantly different from zero, indicating considerable variability in the 
values assigned to any one trait. The women were consistently less variable than the 
men in their ranking of the traits on all three scales individually. 


SUMMARY 


On the basis of theory and empirical studies, it was expected that positive intra- 
sex and intersex relationships would be found among three specific measures of the 
self in a presumably ‘‘normal’’ population of men and women. 

Three scales, defined by their instructions as a self-Ideal, an Actual self and a 
Social self, were administered to unselected male and female samples of a university 
population. In general, the results showed high and significant correlations among 
the three scales for the men, meaning that these Ss, assumed to be normal, do have, 
as expected, similar or congruent conceptions of self. For the women, the correla- 
tions are somewhat lower although still significant. It was found that the Social 
self scale introduced a factor which significantly lowered the levels of agreement 
between it and the other two scales for the women. This finding was interpreted to 
mean that these female Ss see themselves as being and behaving in a manner differ- 
ent from what they believe society expects of them. This discrepancy was character- 
ized as a conflict based on either “reality’’ or ‘“distortion”’. 

On the basis of the obtained high, positive and significant intersex correlations, 
it was inferred that the sexes have highly similar concepts of self, ideally, realistically, 
and socially. Self concept theory may only be applicable to within group congruence 
or discrepancy. These findings may be seen as some support for the self theory where 
it concerns men. There is doubt cast on either the specific adjustment level of these 
women or on the accuracy of the theory. 
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SIMILARITY OF PERSONALITY IN THE SOCIOMETRIC PREFERENCES 
OF MENTAL PATIENTS! 


WILLIAM G. SHIPMAN 
University of Pittsburgh Medical School 


PROBLEM 

The concept of measuring a person’s strong, inner drives by the study of his likes 
and dislikes for others known to have these same trends was used as the basis of the 
Szondi Test“, but attempts to validate this principle using the test’s unclear pic- 
tures of mental patients in outmoded dress have been generally unsuccessful“. This 
study attempted to bypass the problem of pictorial representation by using actual 
people with mental disorders—patients on a ward of a mental hospital. Not only 
were they the people being chosen but they were the choosers. Despite early notions 
to the contrary, psychotic patients have recently been shown to have measurable 
feelings for each other®: ® and there is a current growing awareness of and interest 
in the social structure among the mental patients on a mental hospital ward @®: "!. ”, 

As in many of these previous ward studies the sociometric technique developed 
by Moreno? was used in the present study to discover whether the choices of these 
people with extremes of personality organization were significantly related to their 
own personality type. The hypothesis to be tested is that people of a given person- 
ality structure will tend to prefer as friends people of a similar personality structure. 


METHOD 

Subjects. The subjects were 59 mental patients. All of them were on one mental 
hospital ward of 115 male patients of fairly good adjustment. The remainder of the 
ward population was not tested because the patients were either too sick to cooperate 
or unreachable due to a tight schedule of work and recreational activities. Of the 59 
cooperating subjects, 22 were considered mainly paranoid, 18 were chiefly schizoid, 7 
were sociopathic, 4 were emotionally unstable, and 8 were unclassifiable. Twenty 
were rated as being in good psychological adjustment, 27 as poor, and 12 as very 
poor. The mean age was 43 and the mean education was tenth grade. There were 
no statistically significant differences betweer. the paranoid and schizoid groups for 
age, education or rated psychological adjustment. 


Procedure. Each patient was seen by E in a private room, reassured about the 
confidentiality of the material, then asked: ‘Suppose you have been given a free trip 
around the world, and you can take one of the men from the ward with you. Which 
one would you choose?” The patient’s answer and explanation for it were followed 
by a similar inquiry about his second and third choices. The case folders were then 
consulted for the patient’s age and education. The ward psychiatrist, who knew all 
the men well, estimated their level of psychological adjustment (good-poor-very 
poor) and classified their personality structure according to the following scheme 
which is drawn from the A. P. A. Diagnostic and Statistical Manual “*. pp. 4-38); 

1. Paranoid 4. Inadequate 7. Cyclothymic 


2. Schizoid 5. Sociopathic 8. Unclassifiable 
3. Compulsive 6. Emotionally unstable 


The paranoid and schizoid categories were used to help differentiate among the 
paranoid schizophrenics who were either predominately schizoid or predominately 
paranoid. 

RESULTs AND DIscussION 


The hypothesis that people prefer others of a similar character structure was 
tested in two comparisons because the ward population was comprised predominately 


'The author wishes to thank Doctors Schilling, Greizman, and Lane as well as the pte at 


Torrance (Penna.) State Hospital, for their cooperation in this study. The writer was on t 


e hospital 
staff when this experiment was done. 
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of two groups, 45 paranoids and 37 schizoids. Comparisons of the degree to which 
these two groups tended to prefer others of a similar personality structure are pre- 
sented in Tables 1 and 2. Table 1 reveals a much stronger than chance tendency for 


TABLE 1. ExpPecTED AND OBTAINED PREFERENCES OF PARANOIDS 








| | | 
| 


Item Paranoid | Schizoid | Others | Total 








Frequency of patients in ward 45.0 37.0 | 33.0 





Expected chance ratio of 
patients’ total choices .7° 
Obtained total choices 
by patients* 36.0 








Expected chance ratio of 
patients’ first choices + 
Obtained first choices 
by patientst 13.0 











*X? is 29.42, p < .O1. 
+Cell frequencies are too small to calculate X?. 


**Since a patient could not choose himself the expected frequencies for the paranoids 
are N - 1 


the paranoids to prefer other paranoids while Table 2 shows the preferences of the 
schizoids to be distributed in an apparently chance manner. The rationale of the 
Szondi Test states that the category chosen indicates the extent of the particular 
need tension. Since there is a markedly similar need tension system in all patients 
of the same personality structure and a generally similar degree of acceptance of it 
there should be strong mutual preferences. The behavior of the paranoids seems to 
fit this theory but the behavior of the schizoids seems a serious challenge to it. 


TasB.Le 2. EXprcTED AND OBTAINED PREFERENCES OF SCHIZOIDS 











| 
Item Schizoid | Paranoid | Others Total 





Frequency of patientsin wards} 37.0 45.0 33 .0 115.0 

Expected chance ratio of 
patients’ total choices 11.7** 14.6 10.7 37.0 

Obtained total choices 
by patients* 12.0 15.0 37.0 











Expected chance ratio of 
patients’ first choices 5.7 a | 5.4 18.0 
Obtained first choices 
by patientst 6.0 9.0 | ’ 18.0 

















*X? is .0078, p > .99. 
+Cell Frequencies are too small to calculate X*. 


**Since a patient could not choose himself the expected frequencies for the schizoids 
are N - 1. 


Apparently the paranoid patients responded positively to themselves in others 
while the schizoid patients did not even react negatively to it. This suggests that the 
paranoids were more personally involved in their social milieu than were the schi- 
zoids. This is consistent with general concepts about the two conditions® ™. A 
schizoid commonly avoids close relations with others and tends toward autistic think- 
ing while paranoids are characterized by marked sensitivity in interpersonal re- 
lations and extensive use of the projection mechanism. The restitutional nature of 
paranoid symptoms is considered to keep the patient functioning well while his 
thinking is psychotic about certain specific matters“. 
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Most experts consulted about this project expected the paranoids to dislike each 
other because they would sense and avoid each other’s suspiciousness and uncon- 
scious homosexual impulses“: »- #7), Psychological research also suggests this in 
that Cohen®? found people who project to be the most mutually antagonistic of all 
the groups he studied. Perhaps this strong mutual preference of the paranoids was 
facilitated by these unconscious factors as well as by a possible readiness to believe 
and support each other’s delusional character defenses. 

The apparently random preferences of the schizoids raises the question of 
whether they did, indeed, choose on a random basis or whether it was done in some 
personally meaningful way that was independent of perception of self in others. In 
order to answer this question the preferences of the two patient groups were studied 
to see whether the factors of similarity of age and ward location were operative in a 
differential manner. It was thought that they would operate more strongly on super- 
ficial choices than on choices determined by more personal factors. The obtained 
Pearson correlations are all small and statistically insignificant except for one for the 
paranoids (r = .29) which indicates a mild tendency for them to choose as friends 
people living near their quarters on the hospital ward. 

The reasons given by the patients for their preferences were searched for clues 
as to the basis of the schizoid’s selections. Aside from the greater number of reasons 
and the greater specificity in the paranoid’s answers there were no differences be- 
tween the groups. These further checks do not seem to help answer the question of 
what determinants led to the choices of the schizoids. All that we can conclude is 
that the choices were made in an apparently random manner despite the fact that 
the schizoids cooperated with the study and gave meaningful but undifferentiated 
reasons for their choices. 


SUMMARY AND CONCLUSIONS 


An attempt was made to test the tenet of Szondi Test theory that claims mutual 
preferences will appear amongst people of similar personality structure. This was 
done by studying the sociometric preferences of 38 mental patients, 20 of whom were 
considered to have a paranoid personality structure and 18 who were considered to 
have a schizoid personality structure. The paranoids had a statistically very sig- 
nificant preference for each other while the schizoids chose each other only to a de- 
gree expected by chance. The behavior of the paranoids was interpreted as lending 
support to the hypothesis while the behavior of the schizoids was felt to be a serious 
challenge. It could not be determined whether the unpredicted behavior of the 
schizoids was due to complete emotional withdrawal or to relating to others in a 
manner independent of one’s perception of self in others. 
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SELF AND PEER PERSONALITY RATINGS OF 
PSYCHOTHERAPISTS 


ELI A. RUBINSTEIN AND MAURICE LORR* 


Veterans Administration and Catholic University of America 


PROBLEM 


Studies devoted to objective measurement of the personality characteristics of 
persons in various professions dealing with mental illness have been relatively few “?. 
And within this area no work seems to have been done in providing any comparative 
personality descriptions of psychiatrists, clinical psychologists, and psychiatric 
social workers to discover any distinctive group characteristics. In view of the pres- 
ent acceptance of the ‘‘team”’ approach to psychiatric treatment, especially in out- 
patient clinics, it would also be useful to know if any significant group differences in 
personality exist among these three disciplines. 

As part of a larger research on evaluation of change during psychotherapy we 
attempted to obtain some information on the personality characteristics of ther- 
apists and how those characteristics may be related to patient changes in treatment. 
The various clinics collaborating in the major study were invited to send in person- 
ality ratings of clinic therapists, including psychiatrists, psychologists, and social 
workers who were all seeing patients for intensive treatment. This original purpose 
of this phase of the research could not be carried out because not enough of these 
ratings were on those particular therapists who were working with the patients in 
our research study. There are nevertheless, some interesting findings that can be 
reported on some of the characteristics of therapists as revealed in these data. 


PROCEDURE 


The Multidimensional Scale for Rating Psychiatric Patients, previously des- 
cribed “), was completed by 19 psychiatrists, 27 psychologists, and 21 social workers. 
These were all full-time VA personnel in various outpatient clinics.' All functioned 
as therapists in outpatient treatment at the time of this study. These 67 persons 
rated themselves on the rating scale. At the same time, two peers of the same or 
other disciplines who were well acquainted with the subjects also independently 
rated them on the MSRPP. In some cases a rater described more than one colleague. 
In other instances, the person rated also acted as a rater. The total number of in- 
dividuals involved was 86. All ratings were kept anonymous and were coded to 
allow for matching of self and peer ratings and to designate the discipline of the 
ratee and the rater. 

The analysis of the ratings was in terms of factors previously isolated“. Six of 
these factors are essentially descriptive of dynamic personality characteristics while 
four are mainly in terms of symptoms. The analysis of the therapists’ ratings was 
restricted to these six dynamic factors. A factor score consisted of the mean rating 
on the group of items which best defined a factor. In all, 25 items were included in 
the rating. 


RESULTS AND DISCUSSION 


Table 1 shows the mean factor scores by discipline and by factor for all subjects 
and for both self and peer ratings. The possible range of scores is from 1 to 6, with 
3.5 a mid-point score. In no case is there a significant difference, in terms of the 
usual “‘t’’ test, between mean ratings for the three disciplines on any single factor. 

*From the Veterans Benefits Office, Washington, D. C. The authors gratefully acknowledge the 
assistance of Elizabeth Turk with the statistical computations. : 

1We are indebted to the VA Mental Hygiene Clinics and ihe respective clinic staffs in the follow- 
ing cities for cooperating in the collection of this data: Baltimore, Brooklyn, Chicago, Denver, New- 
ak, Pittsburgh, St. Louis, St. Paul, San Francisco, and Washington, D. C 
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TABLE 1. Me&AN Factor Scores oF THERAPISTS 


Factors 








Ratings 


Self 
Psychologists 
Psychiatrists 
Soc. Worker 


Peer (1) 
Psychologist 
Psychiatrist 
Soc. Worker 


Peer (2) 
Psychologist j ‘ : : ; S 
Psychiatrist , ‘ d 5 i .38 
Soc. Worker 3. ‘ ° . : 25 

















Nor is there in any case a significant difference between self and peer rating for a 
particular discipline on a particular factor. Briefly, the six factors may be described 
asfollows: — 

Factor A involves emotional responsiveness. At the high end, in terms of our scoring, is the 


emotionally restrained person who rarely expresses his feelings overtly. At the other extreme is 
the uninhibited individual who easily expresses his feelings. 


Factor B measures suspicious rebelliousness and a lack of concern for others versus trusting 
acceptance of authority plus much concern for the welfare of others. A high score indicates the 
former end of the continuum. 


Factor C defines the degree of manifest tension and irritability. The higher the score, the 
greater the tension. 


Factor D is concerned with the sense of personal adequacy. At the high score end the in- 


dividual has a weak belief in himself, aay way to others and usually feels inadequate. At the 
i 


other extreme there is a very strong feeling of personal adequacy. 


Factor E is descriptive of motivation for long term goals. A high score is indicative of high 
standards, much drive and high self-reliance. A low score describes an individual with low 
standards, little motivation for achievement and considerable dependence on others. 


Factor F measures conscientiousness, orderliness, and acceptance of responsibility. A high 
score implies the presence of these. A low score indicates the contrasting grouping. 


In general, if we interpret 3.5 as a mid-point score, it would seem that these 
therapists are neither emotionally restrained nor unrestrained, as indicated by Factor 
A scores, and are somewhat more friendly than hostile with authority figures, as 
indicated by Factor B scores. At the same time, on Factor C they are rated as having 
considerably less than average tension, instability, and anxiety. Factor D: indicates 
a somewhat average sense of personal adequacy. On Factors E and F, therapists 
show considerably greater than average motivation for long term goals and also much 
conscientiousness and concern for order. 

This description of the typical therapist is relative to the norms incorporated 
into the various scales. It would be of interest to compare these averages to a pre- 
sumed normal population of similar age, education, and socio-economic status. 
Although normal population ratings are not available, comparative data are available 
on a sample of 176 neurotic patients. A comparison of these ratings with a pooled 
score of the two peer ratings shows some striking differences as given in Table 2. It 
will be noted that all “‘t’’ values except one are significant beyond the .001 level, with 
the greatest differences occurring in Factors B, C, D, and E. It would seem that the 
patients are rated as much more anxiously tense, with much more hostile rebellious- 
ness, lower sense of personal adequacy, and with much less motivation for long-term 
goals than the therapists who work with these patients. 

The reliability of these therapists’ ratings is of obvious pertinence. It is also 
pertinent to note that inter-judge agreement, involving complex clinical judgments, 
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TABLE 2, DIFFERENCES BETWEEN MEAN Factor Scores oF THERAPISTS AND 
PATIENTS 








Factor Therapist Patient 





3.57 
3.24 
2.66 
3.38 
4.10 
: 4.32 


significant beyond the .001 level 
significant beyond the .05 level 








* 
* 


is often considerably lower and more variable than other measures of reliability. 
Hunt? has pointed out some of the problems involved in obtaining a suitable 
measure of reliability in such instances. 

As one approach to the question of reliability we obtained product-moment 
correlations between self-ratings and an average of the two peer ratings for each of 
the 25 items in the rating scale. This pooling was done in an attempt to obtain a 
more stable measure of peer ratings to compare with self-rating. Using a z transform- 
ation“? we obtained an average correlation of .66 for the resulting 67 correlations. 
The range of these correlations is fairly wide, with three correlations just below .20. 
However, 55 of the 67 correlations range from .53 to .87 (o, = .204 when r = 0) 
which, for inter-judge agreements is in keeping with findings such as in Hunt”. 

These latter correlations might have been somewhat higher if it were not for 
the restricted range of the ratings. These were all six point scales and the range was 
even more restricted by virtue of the fact that the scales had originally been devised 
for rating neuropsychiatric patients. The patients would have more extreme person- 
ality characteristics than the therapists and the range of scores for this latter group 
would consequently be narrower. As an indication of this difference it should be 
noted that the standard deviations of the factor scores in Table 1 average about .5 
of a rating interval. The comparable standard deviations of ratings for the patient 
sample in Table 2 are just about twice as large. 

At the same time, there is another and opposing influence which may tend to 
raise these correlations. To what extent are the ratings providing a personality des- 
cription with general applicability? In other words, is there a basic stereotype which 
partially describes all the therapists? Since these are all persons in a trio of highly 
related professions it may well be that individuals in these professions have consider- 
able personality similarities. Furthermore, as was pointed out above, these scales 
were designed to describe a patient population. To the extent that a sample from 
another population did not demonstrate any of the extreme characteristics of the 
patient population there would be some agreement of rating. As one approach to 
this question we took 15 self ratings at random, 5 psychiatrists, 5 social workers, and 
5 psychologists each and obtained 15 product-moment correlations by comparing 
each member of each group of 5 with the other two groups respectively. The cor- 
relations were all positive, with a range from .25 to .68, with 5 of the 15 coefficients 
above .50. Again using a z transformation we obtained an average correlation of 
.48 for the 15 correlations, which is greater than twice the standard error for a single 
such correlation. Our data would thus seem to support the conclusion that there is 
some general similarity among all the persons rated. Or to be more precise, these 
therapists tend to describe themselves similarly. 

While this latter finding should be taken into account in any interpretation of a 
coefficient of reliability, it does not necessarily decrease the stability or meaningful- 
ness of the mean scores given in Tables 1 and 2. Considering the nature of the 
data, it seems quite likely that the mean factor scores are fairly stable and the differ- 
ence between patient scores and therapist scores indicates a significant difference 
between the two groups. Furthermore, the very similar ratings, both self and peer, 
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of psychotherapists in the three disciplines have implications for the team approach 
which would warrant further exploration. 


SUMMARY 


Self and peer personality ratings on the MSRPP were obtained on a group of 67 
psychotherapists in VA mental hygiene clinics. These included psychiatrists, 
psychologists, and social workers. Analysis in terms of six previously identified 
factors showed no significant differences among the three disciplines on any of the 
factors. At the same time, there were significant differences between these therapists 
and a sample of patients who had also been rated on all the factors. Therapists were 
rated as less tense and anxious than the patients, with less hostile rebelliousness, a 
higher sense of personal adequacy, and more motivation for long term goals. 
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THE SCATTER PATTERN OF A SOUTHERN NEGRO GROUP ON THE 
WECHSLER-BELLEVUE INTELLIGENCE SCALE 


JULIAN C. DAVIS 
Florida State Hospital, Chattahoochee, Florida 


INTRODUCTION 

When so called comparable groups of whites and Negroes are tested, psycho- 
metric results of the Negro group almost invariably are inferior.®: * ™ In our own 
experience it has been found that over half of all Negroes examined at the Florida 
State Hospital scored within the defective range on standard individual intelligence 
tests. Despite the group being partially composed of mental patients, the high in- 
cidence of deficiency is obviously a psychometric artifact as substantiated through 
work histories and clinical observation of the individuals. This is not unusual when 
it is considered that the intelligence scales in current use were standardized on essen- 
tially non-Negro populations, a circumstance which leaves us in the lamentable pre- 
dicament of having no instrument available for the accurate assessment of the 
Southern Negro. The need for such an instrument has led to the present preliminary 
investigation. 

Even though the position has been challenged, “: ®) most investigators are in 
agreement that “an individual of a given socio-economic status has a particular 
cultural experience, training, and motivation, which have developed certain areas of 
his mental behavior and certain skills more than others”.“) Through studying a 
sufficient number of Wechsler protocols of Southern Negroes we hope to learn which 
areas and skills have developed most. Not only will this information make possible 
a more accurate interpretation of the findings, which Wechsler’ says need special 


‘Wechsler, D. Personal communication January, 1949. 
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interpretation, but should furnish valuable information concerning a choice of prob- 
lems that are equally common and equally motivating for this culture group in the 
construction of a new test. 

The basic philosophy underlying this study is stated in the observation of 
Finner’ that ‘‘We are possibly unscientific in attempting to measure the Negro by a 
yardstick originally constructed for whites. Eventually it seems that we will need 
to develop a scale that will permit the unique qualities of the Negro to manifest them- 
selves if there are such unique qualities. For the time being it will not matter where 
the beginning is made. The need is for some objective work in the field.” 


PROCEDURE 


The control group for this study is composed of twenty-seven Negro hospital 
employees, both men and women. A!I were given the Full Scale, Wechsler-Bellevue, 
Form II. The age range was from 18 to 60 years with the median being 22 years. 
Nine subjects were under twenty, and six over thirty, with only one being over forty. 

The patient sample is composed of thirty-three Negro mental patients, men and 
women, chosen at random from the patient files of the Florida State Hospital ir- 
respective of diagnosis. The only selective factor was that the Full Scale of the 
Wechsler-Bellevue, Form II had to be completed. Age range of the patient group 
was 14 to 59 with the median age of 35. Four patients were under twenty and six 
over forty. 

The education of both groups ranged from no school years completed through 
one year of college. The number of years of school completed for the study groups 
compares favorably with that of nonwhites 25 years old and over in the State of 
Florida, as of 1950). Table 1 gives the comparative figures. 


TaBLe 1. ComMpaRATIVE YEARS OF ScHooLt CoMPLETED By Test Groups AND NONWHITE PERSONS 
25 Years OLD AND OVER FOR THE StaTeE oF Fiorima, 1950. 








Years of school completed Median 
school 





Elem. school | High school College 





Groups nad 1-4 | 58 | 13 4 1-3 x. 


ine 


Nonwhite Population 8% | 34% | 40% | 10% | 5% | 2%| 2% 
Patients (N = 33) 9% | 36% | 42% | 6G 0 A 
Employees.(N = 27) 7% | 33% | 33% | 15% | 7% | 4% 


| | 


more 

















RESULTS 


The IQ’s of the control group ranged from a low of 51, earned by a twenty-one 
year old man with a third grade education, to a high of 99, achieved by a thirty-three 
year old man with one year of college. (Incidentally, the man with the latter score 
is obviously of superior intelligence despite his ‘“‘average’” IQ. He served as a staff 
sergeant in the Army and was an excellent attendant on the receiving service at the 
State Hospital.) The mean IQ for the group is 68. The IQ range of the patient group 
is from a low of 45 earned by a man with no formal schooling to a high of 108 by a 
man with one year of college. The mean IQ for the group is 67. 

More frequently than not verbal ability exceeds performance ability. There is a 
wider variability of sub-test scores on the verbal scale, and at the higher levels per- 
formance scores are sometimes above the verbal scores. However, discrepancies 
large enough to be significant are usually found to be in favor of verbal ability. The 
test scatter follows the same general pattern in both groups. This pattern can be 
seen most clearly by an examination of the group mean weighted scores in Table 2. 


*Finner, P. F. Personal communication January, 1949. 
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TasLe 2. Mean WEIGHTED Scores oF SOUTHERN NEGRO GROUPS ON THE 
WEcHSLER-BELLEVUE Form II. 





Sub-tests Employees (N = 27) Patients (N = 33) 


Information 
Comprehension 
Digit Span 
Arithmetic 
Similarities 
Vocabulary 

Picture Arrangement 
Picture Completion 
Block Design 
Object Assembly 
Digit Symbol 
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Discussion 

Digit Span and Digit Symbol, along with Similarities, proved most difficult for 
our sample. Digit Span correlates very poorly with all other tests of general intelli- 
gence“ and Digit Symbol handicaps individuals not practiced in the use of paper 
and pencil. Wechsler“®’ found Digit Span to correlate best with Digit Symbol and 
least with Comprehension, a test upon which the sample did well. The implication is 
that Southern Negro scores are depressed most by tests that are not measures of 
general intelligence for this group. 

The low score on Similarities shows that the group has difficulty in differentiat- 
ing between essential and superficial likenesses and suggests immaturity of thought. 
Environmental factors might well account for this lack of maturity and no attempt 
is made to conclude that race per se, (at least as represented by the Southern Negro) 
is a limiting factor in intelligence. In fact, additional supportive evidence of the 
influence of environmental factors can be drawn from the higher scores obtained on 
Comprehension and Vocabulary than on Information and Arithmetic. The latter 


scores are influenced to a greater degree by schooling. 


SuMMARY AND CONCLUSIONS 

This is a preliminary report of the performance of two Southern Negro groups, 
33 mental patients and 27 hospital employees on the Wechsler-Bellevue Form II. 
The sample is too small to admit definite conclusions but the results suggest a dis- 
tinct sub-test scatter pattern for the Southern Negro with a probable relationship 
between environmental factors and sub-test difficulty. There is the added implica- 
tion that Southern Negro scores may be depressed most by tests which are not 
measures of general intelligence for this group. 


REFERENCES 

1. Atrus, W. D. A note on group differences in intelligence and the type of test employed. J. 
consult. Psychol., 1948, 3, 194-195. 

2. Buriockx, H. A. A comparison of the academic achievement of white and Negro high school 
graduates. J. educ. Res., 1950, 44, 179-192. 

3. Davis, W. A. and Havicuurst, R. J. The measurement of mental symptoms. Sci. Mon., 1948, 
66, 301-316. 

4. Hammer, E. F. Comparison of the performance of Negro children and adolescents on two tests 
of intelligence, one an emergency scale. J. genet. Psychol., 1954, 84, 85-93. 

5. Jenkins, M. D. The upper limit of ability among American Negroes. Sci. Mon., 1948, 5, 399-401. 

6. Jenkins, Martin D. and Ranpatt, C. M. Differential characteristics of superior and unselected 
Negro college students. J. soc. Psychol., 1948, 27, 187-202. 

7. Lewrs, H. Innovations and trends in the contemporary Southern Negro. J. soc. Issues, 1954, 
10 (1), 19-27. 

8. McGurk, F. On white and Negro test performance and socio-economic factors. J. abnorm. soc. 
Psychol., 1953, 48, 448-450. 

9. McGurk, F. Socio-economic status and culturally-weighted test scores on Negro subjects. 
J. appl. Psychol., 1953, 37, 276-277. 

10. Wecusier, D. The measurement of adult intelligence, third edition. Baltimore: Williams & 
Wilkins, 1944. f 

11. Youna, F. M. and Cotuies, J. J. Results of testing Negro contact syphilitics with W-B intelli- 
gence scale. J. soc. Psychol., 1954, 39, 93-95. 

12. Detailed characteristics Florida, 1950 population census report P-cl0. Washington: U.S. Gov’t 
Print. Off., 1952. 





THE UTILIZATION OF A PSYCHOLOGIST IN WARD 
ADMINISTRATION! 


WARREN W. WEBB AND HIRAM L. GORDON 
Vanderbilt University V. A. Hospital, Fort Lyon, Colorado 


INTRODUCTION 

Traditional professional roles become firmly established with the passage of 
time and then are resistant to modification“: ®). Under unique and pressing local 
conditions a reappraisal of job requirements and professional skills can bring about 
changes in professional roles“: *». Such was the case locally. There being a desire on 
the part of the Director of Professional Services? to provide knowledgeable and 
interested patient planning despite a growing shortage of psychiatrists, the psycho- 
logy service was asked to assign a psychologist to play a leading role as a member of 
a ward team under the general supervision of an interested psychiatrist. After the 
initial resistance by psychologists and psychiatrists had been worked through, an 
assignment was made. It was a two year assignment and was to be reviewed upon 
completion of an experimental evaluation. This paper reports the results of that 
evaluation. 

The goal of the project was the broad but usual one of structuring a total situa- 
tion within which the individual patient could move in the direction of greater hos- 
pital freedom, return to the community, and remission. The elements of such a 
structured situation which were strived for (although not always attained) were (a) 
closer cooperation and fuller communication among the supervising psychiatrist, 
psychologist, social worker, nurses, aides, physical medicine therapists, and special 
services personnel; (b) increased staff-patient interaction through interviews, in- 
dividual and group psychotherapy, and coordinated work and recreational activities; 
(c) increased contact with families including a family discussion group organized 
by the social worker, and (d) more individualized patient planning. The inevitable 
interpersonal conflicts under these conditions of shifting roles and work emphases 
did occur but will not be recounted here since we have no general conclusions as to 
their resolution. It was our experience that they can only be worked out uniquely 
by the persons involved. 


PROCEDURE AND RESULTS 

Subjects were 132 patients (E) on the experimental ward and 40 control pa- 
tients (C) on five other hospital wards. All the patients were male, with a diagnosis 
of schizophrenia, and were without known organic involvement. Pertinent data for 
comparison purposes is presented in Table 1. 

The variables expected to show the consequences of increased therapeutic 
effort were as follows: Pre- and post-morbidity scores on the Multidimensional 
Scale for Rating Psychiatric Patients’, mean proportion of project time on ground 
privileges, mean proportion of project time on leave of absence, and mean proportion 
of project time out of the hospital (trial visit and discharge status). These results are 


TaBLe 1. Pre-Prosgect Data 








E (N =132) C (N =40) 
Mean SD Mean SD 


Age 32. 5. 
Education ; 3. 
Years since first hospitalization 3 
Morbidity score 1 


Factors 

















‘This study was carried out while both authors were on the staff of V. A. Hospital, Roanoke, 
Virginia. The authors wish to express their gratitude to Drs. C. W. Grady, H. P. Morgan, J. A. 
Newcome, G. I. Meyer, B. M. Smith, B. J. Fitzgerald, Miss Dorothy McGriff, ‘and to a host of others 
too numerous to name for making this study possible. 

*Dr. H. P. Morgan, now Manager, V. A. Hospital, Fort Lyon, Colorado. 
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shown in Table 2. (For administrative reasons E group lost ten patients to other 
wards and C group lost one.) 


Variable | E (N =122) | C (N =39) t 4 





431 
.012 
127 


421 
.008 


.029 2.18 


Mean morbidity score decrease | 2.5 | 1.2 — | — 
a. 
| | <.05 


Mean proportion of project time 


On privileges 
On leave of absence 
Out of hospital 

All the measures show trends in the expected direction with time out of the 
hospital attaining statistical significance. Since the mean morbidity scores’ do not 
show a significant drop for E group as compared with C group it cannot be said that 
basic psychotic processes were generally alleviated by the therapeutic program. Nor 
can it be said from the privileges and leaves of absence data that patient freedom was 
substantially increased during the project period. The time out of the hospital data, 
however, reveal noteworthy success of the planning program in aiding patients to 
move back into the community. 

Two fortuitous happenings should be mentioned. The tranquilizing drugs 
came into widespread use after the inception of this project and could not be experi- 
mentally controlled. Of E group patients, 17 percent received reserpine and 27 per- 
cent received chlorpromazine. With C group patients the percentages were 15 and 
67, respectively. The results of the project, then, cannot be attributed to the use of 
tranquilizing drugs. The second happening was an outbreak of tuberculosis in the 
E group ward during which 21 patients (16 percent) were transferred to TB hospitals 
and hence were no longer able to benefit from the therapeutic program. Further- 


more, the functioning of the entire therapeutic program was hampered by this 
outbreak. 








SUMMARY AND CONCLUSIONS 


The professional skills of a psychologist were called upon in a non-traditional 
assignment to play a leading role in ward administration under the general super- 
vision of a psychiatrist. The total effort of the psychologist and the other members 
of the ward team was compared with that of other ward teams (without a psycho- 
logist in an administrative capacity) in terms of the usual goal of assisting patients 
toward greater hospital freedom, return to the community, and remission. The 
variables expected to show the consequences of increased therapeutic effort (ratings 
of morbidity, time on ground privileges, time on leaves of absence, and time out of 
the hospital) showed trends in favor of the special project ward. On one of these 
variables (time out of the hospital) the difference between the project and control 
wards was statistically significant. It was concluded that the professional skills of 
a psychologist can be of definite value in coordinating the total efforts of ward team 
members in patient planning so as to move the chronic schizophrenic patient back 
toward the community. 
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‘Since the raters may have had at least incidental acquaintance with the patients on the project 
ward, the possibility of bias was considered. A new staff psychologist, who had no information as to 
the identity of the patients, interviewed and rated 10 E patients and 10 C patients. Comparison of 
his ratings with those of the regular raters failed to reveal any trend toward bias. 





EVALUATION OF MODIFIED ADMINISTRATION OF THE CALIFORNIA 
PSYCHOLOGICAL INVENTORY 


LAWRENCE A. BENNETT AND ALVIN RUDOFF 


Deuel Vocational Institution, Tracy, California 


PROBLEM 


There is a growing interest and use of the California Psychological Inventory as 
both a diagnostic and research tool. Since a representative sample is desired in 
standardization projects, many subjects with limited reading ability are included. 
This focuses attention on a problem of the effect of limited reading ability on the 
results obtained with any paper-and-pencil test attempting to assess personality. 

It is assumed that the C. P. I. requires a reading ability of about sixth grade 
level since it is very similar in item content to the M. M. P. I. whose readability has 
been estimated at this level.“ In many situations we attempt to assess individual 
personality changes of a group that includes many who fall below this level. We 
assume on the basis of numerous previous studies and accepted educational theory 
that the speaking or aural recognition vocabulary is substantially higher than the 
reading vocabulary of the group. In this event, it would follow that reading the 
items would make for better, more accurate and reliable responses. However, ex- 
perience in the preliminary administration of the C.P. I. in this manner to undis- 
ciplined groups (e.g., juvenile delinquents), indicated the possibility of adverse in- 
fluence on individual responses of spontaneous group reactions such as laughter or 
side comments. The question then becomes, are the adverse effects of group reaction 
sufficient to overcome the advantages that would obtain as the result of reading the 
test to the group? Our concern, then, is to evaluate the effect of the modification of 
administration on the results; are the results consistent with those obtained from the 
regular administration or, if they vary, to what extent? 


METHOD 


A group of inmates at Deuel Vocational Institution, Tracy, California, was used 
as subjects. A reading ability of at least seventh grade, as measured by the Cali- 
fornia Achievement Test, was required as the only criterion of selection. This insured 
that all subjects would be able to read the items fairly adequately. The first 200 in- 
mates so selected during the regular guidance processing were used in the sample. 
Half of the group read the test items and then, within a three day period, had the 
items read to them. In the case of the other half, the procedure was reversed. 


RESULTS 


With the sequence effects controlled by the design, the results from the two 
groups, called ‘‘Reading”’ and ‘“‘Read To”, were correlated. On each of the subtests 
the correlations were substantially greater than test-retest reliability coefficients 
reported by Gough“ with the single exception of the “Fl” scale (Flexibility). The 
low correlation might well be related to the small number of items (twenty-two), for 
when the means of the two groups are compared statistically, no significant difference 
is found (see Table 1). 

However, most clinicians are more interested in individual profiles than in 
group tendencies. To approach this problem—the objective analysis of a complete 
profile—the coefficient of profile similarity was applied.“ * This included modifica- 
tions to handle segments of a profile of zero slope“) and an index independent of the 
order of the subscales.“ Since the C. P. I. comprises 18 subscales, a total of 153 
slope comparisons were made on each subject. Of the 200 profiles compared, 95.5% 
were found to be statistically similar at the 1% level of confidence. Only 3% fell 
outside the 5% level of confidence for significant similarity and could thus be con- 
sidered as being not similar. 





LAWRENCE A. BENNETT AND ALVIN RUDOFF 


TABLE 1. CoMPARISON OF COEFFICIENTS OF CORRELATION BETWEEN CURRENT 
Test-Retest AND RELIABILITY MEASURES REpoRTED BY Govan"? 


D. V.I. High School 
Inmates Juniors 
(N = 200) (N = 101) 











*Fl Scale Reading Read-To 
Raw Score Mean 8.575 8.515 
Sigma 3.41 3.41 
Difference Between Means .06 

t .25 (not significant) 





SUMMARY AND CONCLUSIONS 


Half of a group of 200 subjects were administered the California Psychological 
Inventory in the usual manner of having the subject read the items to himself. Then, 
within a three day period, the inventory was administered by reading the items aloud 
to the subjects. The other half of the group went through the two forms of the 
administration, but in the reverse order. Inasmuch as the test-retest coefficients of 
correlation using the two methods of administration compared favorably with the 
test-retest measures of reliability using the same administration throughout, it is 
concluded that, as far as groups are concerned, no appreciable difference in result 
occurs due to the modification of administration. 

Furthermore, as the profiles resulting from the two methods are statistically 
similar in 97% of the cases, it can be concluded that modification of administration 
has but an insignificant effect on individual profiles. Thus, where low reading ability 
is a problem, the modified administrative procedure of reading the items to the sub- 
jects can be employed without seriously distorting either group trends or individual 
profiles. This modified procedure has application not only in research studies, but 
also in clinical screening. 
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SEASONAL CHANGES IN ADMISSION RATES 
OF A STATE HOSPITAL! 


PAUL A. HAUCK 
East Moline State Hospital, Illinois 


INTRODUCTION 


Adverse economic, social and political conditions can have a direct effect upon 
mental status, and as a result, psychiatric referrals to clinics and admission rates to 
mental hospitals seem to follow a pattern with such regularity as to make future ad- 
mission and referral rates somewhat predictable. However, other patterns seem to be 
constant even in times when the economic, social, or political conditions are in a state 
of relative rest. There seem to exist seasonal fluctuations in the rate of patient ad- 
missions which have been commonly observed, but as yet only impressionistic evi- 
dence indicates which seasons seem to favor high or low admission rates to state 
hospitals. Work in this area has dealt mainly with noting the incidence of mental 
disease in particular states, or trends as they appear over a period of years, but the 
question of whether a consistent pattern emerges from the monthly admission rate to 
state hospitals has remained largely unexplored. The present study undertakes to 
quantify the admission rates for the East Moline State Hospital on a monthly basis 
for the years 1953, 1954, and 1955. The population was broken down into sex, and 
12 diagnostic groups as recommended in the 1952 Diagnostic and Statistical Manual 
on Mental Disorders. 


RESULTS 


The total admissions for 1953 to 1955 are presented in Table 1. In 1953 the 
highest admission rate was in November with 63 and the lowest in October with 46. 
December takes a sharp drop after the November rise which is then followed by a 
sharp rise during January. In February, the rate again drops. A careful perusal of 
each following year reveals that this basic relationship between these five months 
holds. Taken as a group the six months from September to February contribute 
about an equal share of the years’ admissions, when compared to the remaining six 
months. But among themselves a pattern appears which remains year after year and 
which holds also when the three years are totaled up. Within this six month period, 
small fluctuations from year to year cause October and December to change position 
for the low-rate month, and November and January to change position for the high- 
rate month. Aside from these slight changes it appears that the stability of this 
pattern is being determined by other than chance factors. 

In order to determine if this monthly pattern was purely a local phenomenon 
for the East Moline State Hospital the admission rates of three other state hospi- 
tals for 1955 from widely differing areas and social settings were grouped together. 
These hospitals are at Peoria, Elgin, and Alton, all in Illinois. The total admission 
rates by months for the four hospitals combined are found in Table 1. 

Once again the pattern observed for the East Moline State Hospitai is basically 
repeated when all four hospitals are grouped as one. A slight drop from September 
to October makes October the low month for the year. This is again followed by a 
rise in November. Then a drop to a second low in December after which the rates 
soar to their highest point in January. February follows with its usual dip. 

To determine if the large difference between December and January is mainly a 
function of the alcoholic populations which one could conceivably expect after the 
holiday season, the entire alcoholic population was subtracted from the rates for the 
four hospitals. This manipulation causes some minor changes, September is now the 
lowest month and October is a close third. June has one more admission than Jan- 


'The author is grateful to Phillip W. Wenig, Supervisor of the Statistical Research Section for the 
State of Illinois for making the data for this study available to him. 
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uary, making it the high month. Again, however, the pattern shown above is stil] 
intact. October is low followed by a November rise. December is low followed by a 
January rise and the usual February dip. 


SUMMARY AND CONCLUSIONS 


The months in question each have a characteristic ‘“‘social’’ definition. Decem- 
ber, for example, because it is the month during which Christmas falls is regarded by 
our culture as a time of festivity and good-will. Preparation for December 25 begins 
soon after Thanksgiving and a mood of elation and anticipation is maintained until 
January 1. There would seem then to be good reason why December has a low ad- 
mission rate year after year. 

But, what of the entire five month period? We note that October is low. It is 
the month of beautiful colors, bountiful harvests, and a return to more intimate 
family life in the cool evenings. The children are sent to school with great relief by 
the harassed mothers and they are still allowed to play outdoors a‘ter school. The 
hectic summer is over and old routines are revived. When November appears, the 
cold blasts of winter with its occasional treacherous snows force the children to play 
more indoors than out. The nights are long and many families are confined to close 
living quarters which now require large and expensive amounts of fuel for warmth. 
And admission rates rise. 

In January the dead of winter is to be coped with. There is a let down when the 
holiday wreaths and decorations are packed away. Long periods of cold increase the 
fuel and clothing bills, and the prospect of paying for an abundant Christmas is 
sometimes anxiously received. Colds, coughs and other “‘winter’’ sicknesses and the 
expensive medicines to cure them seem to pile up during these bleak days. License 
plates and taxes probably increase the financial burden. 

These problems are probably brouvht to solution in February by those who do 
not come to the hospital in January. They have the prospect of looking for the com- 
ing of Spring within a month. The days are getting longer and the cold is occasion- 
ally broken by warmer spells. 

March itself is a paradox. At times the rates drop from the February level, and 
at times they go up. No explanation is offered at this time for its erratic behavior or 
for that of the other months. 


THE HOWARD INK BLOT TEST 
A NEW PROJECTIVE TECHNIQUE 


Derived experimentally by James W. Howard, Ph.D. 

An individual test consisting of 12 plates 844” x 11”. The blots are 
somewhat larger than the Rorschach and with a wider range of 
stimulating features. Blots of varied colors selected from an experi- 
mental sample of 10,000 blots. 

Not intended to be a parallel test to the Rorschach. Statistical find- 
ings from a ‘‘normal”’ group show the test to differ markedly from 
the Rorschach in ways which should indicate greater sensitivity. 
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PERSONALITY CHANGES FOLLOWING TRANSORBITAL LOBOTOMY! 
NANCY B. ALGEO? AND MAXWELL 8S. PULLEN® 
Ohio University 
INTRODUCTION 


Robinson, Freeman and Watts“: *) have developed two instruments for the 
measurement of personality changes in lobotomy patients: the Sensibility Question- 
naire (SQ) and the Self-regarding Span test (SRS). The first purports to measure 
“self-adjustability”’ or “‘susceptibility to social pressures;’’ the second is said to 
measure ‘‘self-continuity’’. Robinson administered these instruments to 68 lobotomy 
cases after they had undergone psychosurgery. As controls she used ‘“‘. . . 12 individ- 
uals of similar age and diagnosis who improved without operation.’’ The two groups 
were also equated on the basis of Stanford-Binet Vocabulary scores. Robinson, et al., 
found significantly lower scores in the operated group than in the control group. 
They concluded that lobotomy has the effect of reducing both ‘‘susceptibility to 
social pressure’”’ and “‘self-continuity”’, as these terms were defined operationally 
a, 2). This experiment did not provide for an actual preoperative-postoperative 
control group. They compared the postoperative scores of the lobotomized patients 
to the scores of the control group that did not receive lobotomy, but it appears that 
the experimental and control groups were not necessarily comparable. The very fact 
that the controls did not receive psychosurgery suggests that their personalities may 
have differed from those of the experimental group prior to the latters’ operation. 
These differences, if there were any, could have been in the direction of less ‘“‘self- 
continuity”’ and “susceptibility’”’, which were the very changes that were alleged to 
have occurred as a result of lobotomy. While this situation is not one which negates 
the value of the quoted study, it would seem well worth while to note the results 
when preoperative status of subsequent lobotomy patients is considered. This 


study, then, is an attempt to repeat the main part of the experiment by Robinson, 
Freeman and Watts, with modifications to effect the necessary controls. 


METHOD 


Subjects. The subjects were 34 mental patients at the Athens (Ohio) State 
Hospital, who had been designated to undergo transorbital lobotomy by Dr. Walter 
Freeman. Of this group, 13 were men, 21 women. The age range was 25 to 73 with 
a mean of 38.9. Mean age of the men was 37.3, of the women 39.9. The following 
diagnostic categories were represented: 11 paranoid schizophrenic, 9 catatonic 
schizophrenic, 4 manic-depressive depressed, 3 undifferentiated schizophrenic, 2 
hebephrenic schizophrenic, 2 paranoid state, 2 involutional disorder, and 1 psycho- 
neurotic disorder. Length of education varied from no formal schooling to high 
school graduation, with a mean of 9.1 grades completed. IQ’s (mainly Wechsler- 
Bellevue, Form I) ranged from 65 to 116 with the majority falling in the dull- 
normal and normal categories. Duration of hospitalization ranged from two months 
to eight and one-half years. Mean length of hospitalization was 22.9 months. Onset 
of the mental disorder was gradual in 29 cases and acute in the remaining five. 


Tests. The Robinson-Freeman Self-regarding Span test (SRS)“ was used to 
measure ‘“‘self-continuity”. The term “‘self-regarding span’’ is thus defined opera- 
tionally as the number of seconds that an individual can spend thinking and talking 
about himself as a person. To measure “susceptibility’’ the Robinson-Freeman 
Sensibility Questionnaire (SQ) was used. This test is designed to measure the 


1The authors are indebted to the administration and staff of the Athens State Hospital, whose 
cooperation and assistance made this study possible. 

2Now at Washington, Pennsylvania. 

’This study is based on a portion of the first author’s MA thesis. The second author, who directed 
the thesis, is responsible for the preparation of this report. 
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individual’s susceptibility to emotional experience in two areas: his concern for his 
own past and future, and his concern for the opinions of other people. 


Procedure. The 34 subjects were divided randomly into two groups, designated 
Iand II. Anempirical check, after the division was effected, revealed that the groups 
were virtually equivalent with respect to the control variables of sex, age, education, 
diagnostic category, duration of illness, and type of onset. Group I was given the 
SRS preoperatively and the SQ postoperatively; Group II was given the SQ pre- 
operatively and the SRS postoperatively. The pretests were administered from two 
to four days prior to the operation and the posttests from three to six days after the 
operation. The test-retest interval ranged from six to eight days, and the means of 
the intervals of the two groups did not differ significantly. Pre-postoperative differ- 
ences were computed for the SRS (Group I preoperative scores minus Group II 
postoperative scores), and for the SQ (Group II preoperative scores minus Group I 
postoperative scores). In other words, the total group acted as its own preoperative- 
postoperative control. This procedure was based on the assumption of comparability 
referred to above. The differences between group means were tested for significance 
by means of the standard ¢ test. Preoperative-postoperative differences were also 


computed separately for males and females, and for long vs. short hospitalization 
categories. 


RESULTS 

The reduction in mean scores on the SQ was 5.47 (from 6.18 to 0.71), a differ- 
ence significant at the .05 level of confidence. The mean score reduction on the SRS 
was 4.49 (from 7.12 to 2.63), a difference significant at the .01 level. Thus, while our 
results are not as marked as Robinson’s (owing probably to the smaller number of 
cases), the present study does confirm her conclusion: that lobotomy tends to reduce 
“susceptibility’’ and self-continuity scores. 

Two subsidiary analyses of the data yielded the following results; (1) The pre- 


post differences on both tests were significant for the males as a group (.01 level) ; for 
the females as a group neither instrument yielded a significant change. (2) With re- 
gard to length-of-hospitalization there was a significant (.05 level) reduction in scores 
on the SRS for those hospitalized less than 1)% years; this group did not show a sig- 
nificant reduction on the SQ. For the sub-group hospitalized over 114 years neither 
test yielded a significant change. 


SUMMARY 


In an effort to repeat Robinson and Freeman’s experiment under more con- 
trolled conditions, the SRS Test and the SQ were administered to a group of lobo- 
tomy patients before and after the operation. Significant reduction in both self- 
adjustability and self-continuity were found to occur following lobotomy, thus con- 
firming the evidence obtained by Robinson and Freeman. 
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EDITORIAL OPINION 





SELF-CONSISTENCY IN SCIENTIFICALLY-ORIENTED 
CLINICAL PRACTICE 


After listening to lectures dealing with the validity of many techniques currently 
used in clinical psychology, many students report feelings of bewilderment and frus- 
tration upon discovering that many popular techniques are either unvalidated or 
actually invalid. We often have had students comment: ‘You have not left us much 
to work with. Is it not better to use unvalidated methods than to be completely 
scientific and have to admit that we don’t know anything at all about many prob- 
lems?” Such a dilemma occurs, for example, with reference to psychoanalytic inter- 
pretations of projective test results. In some installations, clinical psychologists 
routinely are expected to formulate interpretive reports which go far beyond any- 
thing which is scientifically valid. Some of these reports soar to speculative heights 
which would do credit to necromancers. The situation can become pretty embarrass- 
ing when such reports are analysed sentence by sentence in order to discover just 
what of proven validity actually exists to justify such interpretations. The student 
who attempts to be scientifically critical may find himself in a very delicate position 
if he attempts to question the validity of what he is being taught. Indeed, consider- 
able prestige may fallaciously accrue to cult members who may achieve transient 
success by virtue of their ingenuity in parroting popular credos. 

There is involved here the extremely important issue of whether the clinician is 
to rigorously adhere to scientific principles at whatever the cost at all times or only 
when it is convenient. There can be no compromise with thoroughgoing scientific 
criticality because any lapse of scientific methodology tends to invalidate the whole 
process. An apt analogy exists in the sterile techniques of modern surgery, 7.¢., the 
whole objective is lost if sterile technique is broken at any point. The clinician is in 
a particularly difficult position because methods and interpretations of proven valid- 
ity are so scarce in relation to the tremendous demands placed upon us by society. 
The best which most clinicians can do is to clearly indicate the point at which exact 
science leaves off and clinical intuition takes up. The young clinician must not be 
ashamed to say that he doesn’t know. At least we can gain the negative merit of 
pointing out what is not true even though we can’t as yet make positive statements 
as to what is valid. 


We believe that this is the issue over which may experimentalists reach a part- 
ing of the ways with their clinical brethren. The psychologist with rigorous experi- 
mental-statistical training is perhaps one of the most self-critical people alive because 
his training causes him to scrutinize every conclusion critically and with reflex 
automaticity. Nothing is left unsupported or unchallenged. Is it any wonder that the 
the experimentalist expresses his perplexity and tends to reject with righteous in- 
dignation many of the methods and conclusions reached by clinicians in their every- 
day practice? Let us spend no time rationalizing many of these practices because 
they are scientifically indefensible. This writer, for example, has been unable to 
utilize many projective methods and psychoanalytic interpretations because earlier 
experimental-statistical training has made it simply impossible to assume a sort of 
split personality, adhering to rigorous scientificality in some areas and becoming 
uncritical in applying unvalidated techniques in the clinical field. The need to be 
self-consistent within the broad framework of experimental-statistical science must 
impose very stringent limitations which we have an obligation to live up to. Let us 
not be clinically-split personalities, scientific up to a certain point only. 

Lest the student be tempted because of his own insecurities to identify himself 
with some of the more glamorous but unvalidated credos of our time, we hasten to 
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remind him that some of the most respected figures of clinical and applied psychology 
have been rigorous scientists whose lasting devotion to basic principles has won 
them more permanent respect than some meteoric theorists whose status rapidly 
fades out at the moment of their most apparent brightness. Of the older generation 
of master clinicians, who achieved higher recognition than Frederick L. Wells, 
Walter V. Bingham, Lewis M. Terman or Grace H. Kent, and yet who were more’ 
modest, unassuming and consistently conservative in not attempting conclusions 
unsupported by valid data? Their claim for lasting respect must rest upon self- 
consistent applications of scientific principles. Clinical psychology as science will 
have its development greatly impeded if we allow ourselves to be distracted by 
specious dialectics and theorizing by latter day prophets who may be very imagina- 
tive but are certainly not scientific. ds 
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The subject matter of psychological theorizing is so extensive and complex that we 
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need to have periodic review and evaluation to separate the wheat from the chaff. 
The authors are to be congratulated on having accomplished a most unbiassed and 
eclectic evaluation of current theories of personality. 
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Calif.: Annual Reviews, Inc., 1957, pp. 502. $7.00 


This volume continues in the usual competent tradition of the Annual Reviews 
to present a comprehensive survey of contemporary American psychology. Clinical 
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J. J. Jenkins and D. T. Lykken, personality by C. W. Eriksen, abnormalities of be- 
havior by M. E. Wright, psychotherapy by C. L. Winder, assessment techniques by 
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Levine, Epna 8S. Youth in a Soundless World. New York: New York University 
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UNIQUENESS 


Some children have unique 
needs -- needs beyond the usual 
psychological and psychiatric 
evaluations. With many chil- 
dren it is found that for every 
therapeutic experience in the 
professional office or clinic, 
there are countless other exper- 
iences that are not therapeutic 
-- in a tense or conflictual home, 
in a schoolroom that does not 
meet their needs, or with peers 
with whom they cannot cope. 


It is to serve these children 
that Devereux Schools have de- 
signed various milieu programs 
in which a plan for each child 
can be designed so that he 
learns with teachers of special 
education while he lives in an 
environment organized so as 
not to frustrate or threaten him. 


Professional inquiries should be addressed to John M. Barclay, Director of Development, 
or Charles J. Fowler, Registrar, Devereux Schools, Devon, Pa. For the western states, 


address Joseph F. Smith, Superintendent, or Keith A. Seaton, Registrar, Devereux 
Schools, Santa Barbara, Calif. 
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